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This is an introduction to abstract algebra. It is anticipated that the students have studied 
calculus and probably linear algebra. However, these are primarily mathematical ma- 
turity prerequisites; subject matter from calculus and linear algebra appears mostly in 
illustrative examples and exercises. 

As in previous editions of the text, my aim remains to teach students as much about 
groups, rings, and fields as I can in a first course. For many students, abstract algebra is 
their first extended exposure to an axiomatic treatment of mathematics. Recognizing this, 
I have included extensive explanations concerning what we are trying to accomplish, 
how we are trying to do it, and why we choose these methods. Mastery of this text 
constitutes a firm foundation for more specialized work in algebra, and also provides 
valuable experience for any further axiomatic study of mathematics. 


Changes from the Sixth Edition 


The amount of preliminary material had increased from one lesson in the first edition 
to four lessons in the sixth edition. My personal preference is to spend less time before 
getting to algebra; therefore, I spend little time on preliminaries. Much of it is review for 
many students, and spending four lessons on it may result in their not allowing sufficient 
time in their schedules to handle the course when new material arises. Accordingly, in 
this edition, I have reverted to just one preliminary lesson on sets and relations, leaving 
other topics to be reviewed when needed. A summary of matrices now appears in the 
Appendix. 

The first two editions consisted of short, consecutively numbered sections, many of 
which could be covered in a single lesson. I have reverted to that design to avoid the 
cumbersome and intimidating triple numbering of definitions, theorems examples, etc. 
In response to suggestions by reviewers, the order of presentation has been changed so 
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that the basic material on groups, rings, and fields that would normally be covered in a 
one-semester course appears first, before the more-advanced group theory. Section 1 is 
a new introduction, attempting to provide some fecling for the nature of the study. 

In response to several requests, I have included the material on homology groups 
in topology that appeared in the first two editions. Computation of homology groups 
strengthens students’ understanding of factor groups. The material is easily accessible; 
after Sections 0 through 15, one need only read about free abelian groups, in Section 38 
through Theorem 38.5, as preparation. To make room for the homology groups, T have 
omitted the discussion of automata, binary linear codes, and additional algebraic struc- 
tures that appeared in the sixth edition. 

I have also included a few exercises asking students to give a one- or two-sentence 
synopsis of a proof in the text. Before the first such exercise, I give an example to show 
what I expect. 


Some Features Retained 


I continue to break down most exercise sets into parts consisting of computations, con- 
cepts, and theory. Answers to odd-numbered exercises not requesting a proof again 
appear at the back of the text. However, in response to suggestions, | am supplying the 
answers to parts a), c), e), g), and i) only of my 10-part true—false exercises. 

The excellent historical notes by Victor Katz are, of course, retained. Also, a manual 
containing complete solutions for all the exercises, including solutions asking for proofs, 
is available for the instructor from the publisher. 

A dependence chart with section numbers appears in the front matter as an aid in 
making a syllabus. 
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Suggestions for New Instructors of Algebra 


Those who have taught algebra several times have discovered the difficulties and devel- 
oped their own solutions. The comments I make here are not for them. 

This course is an abrupt change from the typical undergraduate calculus for the 
students. A graduate-style lecture presentation, writing out definitions and proofs on the 
board for most of the class time, will not work with most students. I have found it best 
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to spend at least the first half of each class period answering questions on homework, 
trying to get a volunteer to give a proof requested in an exercise, and generally checking 
to see if they seem to understand the material assigned for that class. Typically, I spent 
only about the last 20 minutes of my 50-minute time talking about new ideas for the next 
class, and giving at least one proof. From a practical point of view, it is a waste of time 
to try to write on the board all the definitions and proofs. They are in the text. 

I suggest that at least half of the assigned exercises consist of the computational 
ones. Students are used to doing computations in calculus. Although there are many 
exercises asking for proofs that we would love to assign, I recommend that you assign 
at most two or three such exercises, and try to get someone to explain how each proof is 
performed in the next class. I do think students should be asked to do at least one proof 
in each assignment. 

Students face a barrage of definitions and theorems, something they have never 
encountered before. They are not used to mastering this type of material. Grades on tests 
that seem reasonable to us, requesting a few definitions and proofs, are apt to be low and 
depressing for most students. My recommendation for handling this problem appears in 
my article, Happy Abstract Algebra Classes, in the November 2001 issue of the MAA 
FOCUS. 

At URI, we have only a single semester undergraduate course in abstract algebra. 
Our semesters are quite short, consisting of about 42 50-minute classes. When I taught 
the course, I gave three 50-minute tests in class, leaving about 38 classes for which the 
student was given an assignment. I always covered the material in Sections 0-11, 13-15, 
18-23, 26, 27, and 29-32, which is a total of 27 sections. Of course, I spent more than 
one class on several of the sections, but I usually had time to cover about two more; 
sometimes I included Sections 16 and 17. (There is no point in doing Section 16 unless 
you do Section 17, or will be doing Section 36 later.) I often covered Section 25, and 
sometimes Section 12 (see the Dependence Chart). The job is to keep students from 
becoming discouraged in the first few weeks of the course. 


This course may well require a different approach than those you used in previous math- 
ematics courses. You may have become accustomed to working a homework problem by 
turning back in the text to find a similar problem, and then just changing some numbers. 
That may work with a few problems in this text, but it will not work for most of them. 
This is a subject in which understanding becomes all important, and where problems 
should not be tackled without first studying the text. 

Let me make some suggestions on studying the text. Notice that the text bristles 
with definitions, theorems, corollaries, and examples. The definitions are crucial. We 
must agree on terminology to make any progress. Sometimes a definition is followed 
by an example that illustrates the concept. Examples are probably the most important 
aids in studying the text. Pay attention to the examples. 1 suggest you skip the proofs 
of the theorems on your first reading of a section, unless you are really “gung-ho” on 
proofs. You should read the statement of the theorem and try to understand just what it 
means. Often, a theorem is followed by an example that illustrates it, a great aid in really 
understanding what the theorem says. 

In summary, on your first reading of a section, I suggest you concentrate on what 
information the section gives, and on gaining a real understanding of it. If you do not 
understand what the statement of a theorem means, it will probably be meaningless for 
you to read the proof. 

Proofs are very basic to mathematics. After you fee! you understand the information 
given in a section, you should read and try to understand at least some of the proofs. 
Proofs of corollaries are usually the easiest ones, for they often follow very directly from 
the theorem. Quite a lot of the exercises under the “Theory” heading ask for a proof. Try 
not to be discouraged at the outset. It takes a bit of practice and experience. Proofs in 
algebra can be more difficult than proofs in geometry and calculus, for there are usually 
no suggestive pictures that you can draw. Often, a proof falls out easily if you happen to 
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look at just the right expression. Of course, it is hopeless to devise a proof if you do not 
really understand what it is that you are trying to prove. For example, if an exercise asks 
you to show that given thing is a member of a certain set, you must know the defining 
criterion to be a member of that set, and then show that your given thing satisfies that 
criterion. 

There are several aids for your study at the back of the text. Of course, you will 
discover the answers to odd-numbered problems not requesting a proof. If you run into a 
notation such as Z,, that you do not understand, look in the list of notations that appears 
after the bibliography. If you run into terminology like inner automorphism that you do 
not understand, look in the Index for the first page where the term occurs. 

In summary, although an understanding of the subject is important in every mathe- 
matics course, it is really crucial to your performance in this course. May you find it a 
rewarding experience. 


Narragansett, RI IBF. 
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0 SETS AND RELATIONS 


On Definitions, and the Notion of a Set 


Many students do not realize the great importance of definitions to mathematics. This 
importance stems from the need for mathematicians to communicate with each other. 
If two people are trying to communicate about some subject, they must have the same 
understanding of its technical terms. However, there is an important structural weakness. 


It is impossible to define every concept. 


Suppose, for example, we define the term set as “A set is a well-defined collection of 
objects.” One naturally asks what is meant by a collection. We could define it as “A 
collection is an aggregate of things.” What, then, is an aggregate? Now our language 
is finite, so after some time we will run out of new words to use and have to repeat 
some words already examined. The definition is then circular and obviously worthless. 
Mathematicians realize that there must be some undefined or primitive concept with 
which to start. At the moment, they have agreed that ser shall be such a primitive concept. 
We shall not define sez, but shall just hope that when such expressions as “the set of all 
real numbers” or “the set of all members of the United States Senate” are used, people’s 
various ideas of what is meant are sufficiently similar to make communication feasible. 
We summarize briefly some of the things we shall simply assume about sets. 


1. A set Sis made up of elements, and if a is one of these elements, we shall 
denote this fact by a € S. 


2. There is exactly one set with no elements. It is the empty set and is denoted 
by @. 

3. We may describe a set either by giving a characterizing property of the 
elements, such as “the set of all members of the United States Senate,’ or by 
listing the elements. The standard way to describe a set by listing elements is 
to enclose the designations of the elements, separated by commas, in braces, 
for example, {1, 2, 15}. If a set is described by a characterizing property P(x) 
of its elements x, the brace notation {x | P(x)} is also often used, and is read 
“the set of all x such that the statement P(x) about x is true.” Thus 


{2, 4, 6, 8} = {x |x is an even whole positive number < 8} 
= {2x |x = 1, 2,3, 4). 


The notation {x | P(x)} is often called “set-builder notation.” 

4. A set is well defined, meaning that if S is a set and a is some object, then 
either a is definitely in S, denoted by a € S, or a is definitely not in S, denoted 
by a ¢ S. Thus, we should never say, “Consider the set S of some positive 
numbers,” for it is not definite whether 2 € S or 2 ¢ S. On the other hand, we 


1 


2 Section 0 


0.1 Definition 


0.2 Definition 


Sets and Relations 


can consider the set T of all prime positive integers. Every positive integer is 
definitely either prime or not prime. Thus 5 € T and 14 ¢ T. It may be hard to 
actually determine whether an object is in a set. For example, as this book 
goes to press it is probably unknown whether 22°) + 1 is in T. However, 
22°) 4 | is certainly either prime or not prime. 


It is not feasible for this text to push the definition of everything we use all the way 
back to the concept of a set. For example, we will never define the number zr in terms of 
a set. . 


Every definition is an if and only if type of statement. 


With this understanding, definitions are often stated with the only if suppressed, but it 
is always to be understood as part of the definition. Thus we may define an isosceles 
triangle as follows: “A triangle is isosceles if it has two sides of equal length,” when we 
really mean that a triangle is isosceles if and only if it has two sides of equal length. 

In our text, we have to define many terms. We use specifically labeled and numbered 
definitions for the main algebraic concepts with which we are concerned. To avoid an 
overwhelming quantity of such labels and numberings, we define many terms within the 
body of the text and exercises using boldface type. 


Boldface Convention 
A term printed in boldface in a sentence is being defined by that sentence. 


Do not feel that you have to memorize a definition word for word. The important 
thing is to understand the concept, so that you can define precisely the same concept 
in your own words. Thus the definition “An isosceles triangle is one having two equal 
sides” is perfectly correct. Of course, we had to delay stating our boldface convention 
until we had finished using boldface in the preceding discussion of sets, because we do 
not define a set! 

In this section, we do define some familiar concepts as sets, both for illustration and 
for review of the concepts. First we give a few definitions and some notation. 


A set B is a subset of a set A, denoted by B C Aor A D2 B, if every element of B is in 
A. The notations B C Aor A D B will be used for B C A but B FA. |_| 


- Note that according to this definition, for any set A, A itself and @ are both subsets of A. 


If A is any set, then A is the improper subset of A. Any other subset of A is a proper 
subset of A. a 


0.3 Example 
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0.6 Example 
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Let S = {1,2,3}. This set S has a total of eight subsets, namely 4, {1}, {2}, 3}, 
{1, 2}, {1, 3}, {2,3}, and {1, 2, 3}. 


Let A and B be sets. The set A x B = {(a,b)|a € A and b € B} is the Cartesian 
product of A and B. | 


If A = {1, 2, 3} and B = {3, 4}, then we have 
Ax B= {(1,3),. 4), (2, 3), 2, 4), GB. 3), 3, H}. A 


Throughout this text, much work will be done involving familiar sets of numbers. 
Let us take care of notation for these sets once and for all. 
Z is the set of all integers (that is, whole numbers: positive, negative, and zero). 


Qis the set of all rational numbers (that is, numbers that can be expressed as quotients 
m/n of integers, where n 4 0). 


R is the set of all real numbers. 

Z*, Q*, and R® are the sets of positive members of Z, Q, and R, respectively. 

C is the set of all complex numbers. 

Z*, Q*, R*, and C* are the sets of nonzero members of Z, Q, R, and C, respectively. 


The set R x R is the familiar Euclidean plane that we use in first-semester calculus to 
draw graphs of functions. A 


Relations Between Sets 


We introduce the notion of an element a of set A being related to an element b of set B, 
which we might denote by a .# b. The notation a .# b exhibits the elements a and b in 
left-to-right order, just as the notation (a, b) for an element in A x B. This leads us to 
the following definition of a relation. as a set. 


A relation between sets A and B is a subset. #7 of A x B. We read (a, b) € Has “a is 
related to b” and write a .# b. | 


(Equality Relation) There is one familiar relation between a set and itself that we 
consider every set $ mentioned in this text to possess: namely, the equality relation = 
defined on a set S by 


= is the subset {(x, x) |x € S}of S x S. 
Thus for any x € S, we have x = x, but if x and y are different elements of S, then 
(x, y) € =and we write x A y. A 


We will refer to any relation between a set 5 and itself, as in the preceding example, 
as a relation on S. 


The graph of the function ( where f(x) = x? forallx € R, is the subset {(x, x*) |x € R} 
of IR x R. Thus it is a relation on R. The function is completely determined by its graph. 
A 
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The preceding example suggests that rather than define a “function” y = f(x) to 
be a “rule” that assigns to each x € R exactly one y € R, we can casily describe it as a 
certain type of subset of R x R, that is, as a type of relation. We free ourselves from R 
and deal with any sets X and Y. 


A function ¢ mapping X into Y is a relation between X and Y with the property that 
each x € X appears as the first member of exactly one ordered pair (x, y) in ¢. Such a 
function is also called a map or mapping of X into Y. We write @ : X — Y and express 
(x, y) € @ by ¢(x) = y. The domain of ¢ is the set X and the set Y is the codomain of 
¢. The range of ¢ is @[X] = {d(x) |x € X}. | 


We can view the addition of real numbers as a function + : (R x R) > R, that is, as a 
mapping of R x R into R. For example, the action of + on (2,3) € R x R is given in 
function notation by +((2, 3)) = 5. In set notation we write ((2, 3),5) € +. Of course 
our familiar notation is 2 +3 = 5. A 


Cardinality 


The number of elements in a set X is the cardinality of X and is often denoted by |X. 
For example, we have |{2, 5, 7}| = 3. It will be important for us to know whether two sets 
have the same cardinality. If both sets are finite there is no problem; we can simply count 
the elements in cach set. But do Z, Q, and R have the same cardinality? To convince 
ourselves that two sets X and Y have the same cardinality, we try to exhibit a pairing of 
each x in X with only one y in Y in such a way that each element of Y is also used only 
once in this pairing. For the sets X = {2, 5, 7} and Y = {?, ! #}, the pairing 
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shows they have the same cardinality. Notice that we could also exhibit this pairing as 
{(2, 2), (5, #), (7, !)} which, as a subset of X x Y, is a relation between X and Y. The 
pairing 

1 2 3 4 5 6 7 8 9 10 

t { 4 t ¢ t ¢ t ¢ ¢ 

0 —1 1 —2 2 —3 3 —4 4 


shows that the sets Z and Zt have the same cardinality. Such a pairing, showing that 
sets X and Y have the same cardinality, is a special type of relation <> between X and 
Y called a one-to-one correspondence. Since each element x of X appears precisely 
once in this relation, we can regard this one-to-one correspondence as a function with 
domain X. The range of the function is Y because each y in Y also appears in some 
pairing x <> y. We formalize this discussion in a definition. 


*A function @ : X — Y is one to one if 6(%)) = @(x2) only when x1 = x2 (see Exer- 
cise 37). The function ¢ is onto Y if the range of ¢ is Y. | 


*We should mention another terminology, used by the disciples of N. Bourbaki, in case you encounter it 
elsewhere. In Bourbaki’s terminology, a one-to-one map is an injection, an onto map is a surjection, and a 
map that is both one to one and onto is a bijection. 
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Ifa subset of X x Y is a one-to-one function @ mapping X onto Y, theneach x € X 
appears as the first member of exactly one ordered pair in @ and also each y € Y appears 
as the second member of exactly one ordered pair in @. Thus tf we interchange the first 
and second members of all ordered pairs (x, y) in @ to obtain a set of ordered pairs (y, x), 
we get a subset of Y x X, which gives a one-to-one function mapping Y onto X. This 
function is called the inverse function of @, and is denoted by @~!. Summarizing, if 
@ maps X one to one onto Y and (x) = y, then @-! maps Y one to one onto X, and 


oy) =x. 


Two sets X and Y have the same cardinality if there exists a one-to-one function mapping 
X onto Y, that is, if there exists a one-to-one correspondence between X and Y. | 


The function f : R — R where f(x) = x? is not one to one because f(2) = f(—2) =4 
but 2 4 —2. Also, itis not onto R because the range is the proper subset of all nonnegative 
numbers in R. However, g : R — R defined by g(x) = x° is both one to one and onto 
R. A 


We showed that Z and Z* have the same cardinality. We denote this cardinal number 
by Xo, so that |Z| = |Zt| = Xp. It is fascinating that a proper subset of an infinite set 
may have the same number of elements as the whole set; an infinite set can be defined 
as a set having this property. 

We naturally wonder whether all infinite sets have the same cardinality as the set Z. 
A set has cardinality &o if and only if all of its elements could be listed in an infinite row, 
so that we could “number them” using Z*. Figure 0.15 indicates that this is possible 
for the set Q. The square array of fractions extends infinitely to the right and infinitely 
downward, and contains all members of Q. We have shown a string winding its way 
through this array. Imagine the fractions to be glued to this string. Taking the beginning 
of the string and pulling to the left in the direction of the arrow, the string straightens 
out and all elements of Q appear on it in an infinite row as 0, 5, -3, 1,-1, 3, -.+, Thus 
|Q| = Xo also. 
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If the set S = {x € R|0 < x < 1} hascardinality No, all its elements could be listed 
as unending decimals in a column extending infinitely downward, perhaps as 


0.3659663426- - - 
0.7103958453 - -- 
0.0358493553 --- 
0.9968452214 -- - 


We now argue that any such array must omit some number in S. Surely S contains a 
number r having as its nth digit after the decimal point a number different from 0, from 9, 
and from the nth digit of the nth number in this list. For example, r might start 637+ -+. 
The 5 rather than 3 after the decimal point shows r cannot be the first number in S 
listed in the array shown. The 6 rather than 1 in the second digit shows r cannot be the 
second number listed, and so on. Because we could make this argument with any list, 
we see that S has too many elements to be paired with those in Z* . Exercise 15 indicates 
that IR has the same number of elements as S. We just denote the cardinality of R by 
R|. Exercise 19 indicates that there are infinitely many different cardinal numbers even 
greater than |RI. 


Partitions and Equivalence Relations 


Sets are disjoint if no two of them have any element in common. Later we will have 
occasion to break up a set having an algebraic structure (e.g., a notion of addition) into 
disjoint subsets that become elements in a related algebraic structure. We conclude this 
section with a study of such breakups, or partitions of sets. 


A partition of a set S is a collection of nonempty subsets of § such that every element 
of S is in exactly one of the subsets. The subsets are the cells of the partition. a 


When discussing a partition of a set S, we denote by X the cell containing the clement 
x of S. 


Splitting Z* into the subset of even positive integers (those divisible by 2) and the subset 
of odd positive integers (those leaving a remainder of 1 when divided by 2), we obtain 
a partition of Z* into two cells. For example, we can write 


14 = {2, 4, 6,8, 10, 12, 14, 16, 18, ---}. 


We could also partition Z* into three cells, one consisting of the positive integers 
divisible by 3, another containing all positive integers leaving a remainder of 1 when di- 
vided by 3, and the last containing positive integers leaving a remainder of 2 when 
divided by 3. 

Generalizing, for each positive integer n, we can partition Z* into n cells according 
to whether the remainder is 0, 1, 2.--- , 2 — 1 when a positive integer is divided by n. 
These cells are the residue classes modulo n in Z*. Exercise 35 asks us to display these 
partitions for the cases n = 2, 3, and 5. A 
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Each partition of a set § yields a relation .# on S in a natural way: namely, for 
x,y €S, let x .# y if and only if x and y are in the same cell of the partition. In set 
notation, we would write x .# y as (x, y) € # (see Definition 0.7). A bit of thought 
shows that this relation .# on S satisfies the three properties of an equivalence relation 
in the following definition. 


An equivalence relation 7 on a set S is one that satisfies these three properties for all 
xXx,yzeS. 


1. (Reflexive) x .# x. 
2. (Symmetric) If x # y, then y #4 x. 
3. (Transitive) If x. y and y.# z then x .# z. a 


To illustrate why the relation .# corresponding to a partition of S' satisfies the 
symmetric condition in the definition, we need only observe that if y is in the same cell 
as x (that is, if x .# 5), then x is in the same cell as y (that is, y .# x). We leave the 
similar observations to verify the reflexive and transitive properties to Exercise 28. 


For any nonempty set S, the equality relation = defined by the subset {(x, x) |« € S} of 
S x S is an equivalence relation. A 


(Congruence Modulon) Letn € Z*. The equivalence relation on ZT corresponding 
to the partition of Z* into residue classes modulo n, discussed in Example 0.17, is 
congruence modulo n. It is sometimes denoted by =,. Rather than write a=,b, we 
usually write a = b (mod n), read, “a is congruent to b modulo n.” For example, we 
have 15 = 27 (mod 4) because both 15 and 27 have remainder 3 when dividedby4. A 


Let a relation .# on the set Z be defined by n .# m if and only if nm > 0, and let us 


determine whether .# is an equivalence relation. 

Reflexive a. a, because a? > 0 for alla € Z. 

Symmetric Ifa), then ab > 0,so ba > Oandb Aa. 

Transitive Ifa.#bandb.4Ac, then ab > Oand bc > 0. Thus ab*c =acb? > 0. 
If we knew b? > 0, we could deduce ac > 0 whence a # c. We have to examine the 
case b = 0 separately. A moment of thought shows that —3 .#% 0 and 0.# 5, but we do 
not have —3 .72 5. Thus the relation .# is not transitive, and hence is not an equivalence 
relation. A 


We observed above that a partition yields a natural equivalence relation. We now 
show that an equivalence relation on a set yields a natural partition of the set. The theorem 
that follows states both results for reference. 


(Equivalence Relations and Partitions) Let S be a nonempty set and let ~ be an 
equivalence relation on S. Then ~ yields a partition of S, where 


a={xeS|x~a}. 


Section 0 Sets and Relations 


Also, each partition of S gives rise to an equivalence relation ~ on $ where a ~ bif and 
only if a and b are in the same cell of the partition. 


Proof We must show that the different cells a = {x € S|x ~ a} fora € S do give a partition 
of S, so that every element of S$ is in some cell and so that if a € b, then G@ = BD. Let 
a € S. Then a € d by the reflexive condition (1), soa is in at least one cell. 
Suppose now that a were in a cell b also. We need to show that @ = D as sets; this 
will show that a cannot be in more than one cell. There is a standard way to show that 
two sets are the same: 


Show that each set is a subset of the other. 


We show thata@ € b.Letx € G.Thenx ~ a. Buta € b, soa ~ b. Then, by the transitive 
condition (3), x ~ b, sox € b. Thus a C b. Now we show that b Ca. Let y € b. Then 
y~ b.Butae b,soa ~ band, by symmetry (2), b ~ a. Then by transitivity 3), y ~ a, 
so y € d. Hence b € & also, so b = G and our proof is complete. o 


Each cell in the partition arising from an equivalence relation is an equivalence 
class. 


@ EXERCISES 0 


In Exercises 1 through 4, describe the set by listing its elements. 

1. {x € R[x? = 3} 2. {m € Z| m? = 3} 

3. {m € Z| mn = 60 for some n € Z} 4. {m €Z|m?> —m < 115} 
In Exercises 5 through 10, decide whether the object described is indeed a set (is well defined). Give an alternate 
description of each set. 

5. {n € Z* |n isa large number} 

6. {n€Z|n° <0} 

7. {n€Z|39 <n? < 57} 

8. {x € Q|x is almost an integer} 

9, {x € Q|x may be written with denominator greater than 100} 

10. {x € Q|x may be written with positive denominator less than 4} 
11. List the elements in {a, b, c} x {1, 2, c}. 


12. Let A = {1, 2, 3} and B = {2, 4, 6}. For each relation between A and B given as a subset of A x B, decide 
whether it is a function mapping A into B. If it is a function, decide whether it is one to one and whether it is 


onto B. 

a. {(1, 4), 2, 4), G, 6)} b. {(1, 4), (2, 6), (3, 4} 
ce. {(1, 6), (1, 2), (1, 4)} d. {(2, 2), (1, 6), 3, 9} 
e. {(1, 6), (2, 6), (3,.6)} f. {1 2), (2, 6), (2, 9} 


13. Illustrate geometrically that two line segments AB and C D of different length have the same number of points 
by indicating in Fig. 0.23 what point y of CD might be paired with point x of AB. 


14. 


15. 
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Recall that fora, b € Randa < b, the closed interval [a, b] in R is defined by [a, b] = {x € Rla < x < 5}. 
Show that the given intervals have the same cardinality by giving a formula for a one-to-one function f mapping 
the first interval onto the second. 


a. [0, 1] and [0, 2] b. [1, 3] and [5, 25] c. [a, b] and [c, d} 
Show that § = {x € R|0 <x < 1} has the same cardinality as R. [Hint: Find an elementary function of 


calculus that maps an interval onc to one onto R, and then translate and scale appropriately to make the domain 
the set $.] 


For any set A, we denote by / (A) the collection of all subsets of A. For example, if A = {a, b,c, d}, then 
{a, b, d} € P(A). The set (A) is the power set of A. Exercises 16 through 19 deal with the notion of the power 
set of aset A. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


List the elements of the power set of the given set and give the cardinality of the power set. 
a @ b. {a} ec. {a, b} d. {a,b,c} 


Let A be a finite set, and let |A] = s. Based on the preceding exercise, make a conjecture about the value of 
|S (A)|. Then try to prove your conjecture. 


For any set A, finite or infinite, let B“ be the set of all functions mapping A into the set B = {0, 1}. Show that 
the cardinality of B“ is the same as the cardinality of the set (A). [Hint: Each element of B* determines a 
subset of A in a natural way.] 

Show that the power set of a set A, finite or infinite, has too many elements to be able to be put in a one-to-one 
correspondence with A. Explain why this intuitively means that there are an infinite number of infinite cardinal 
numbers. (Hint: Imagine a one-to-one function ¢ mapping A into / (A) to be given. Show that ¢ cannot be 
onto /(A) by considering, for each x € A, whether x € $(x) and using this idea to define a subset S of A that 
is not in the range of ¢.] Is the set of everything a logically acceptable concept? Why or why not? 

Let A = {1, 2} and let B = {3, 4, 5}. 


a. Illustrate, using A and B, why we consider that 2 +3 = 5. Use similar reasoning with sets of your own 
choice to decide what you would consider to be the value of 


i. 3+, fi. No + No. 
b. Illustrate why we consider that 2-3 = 6 by plotting the points of A x B in the plane R x R. Use similar 
reasoning with a figure in the text to decide what you would consider to be the value of Xo - Ro. 


How many numbers in the interval 0 < x < 1 can be expressed in the form .##, where each # is a digit 
0, 1,2, 3,---, 9? How many are there of the form .#4H##?? Following this idea, and Exercise 15, decide what 
you would consider to be the value of 10°. How about 12*° and 2*«? 


Continuing the idea in the preceding exercise and using Exercises 18 and 19, use exponential notation to fill in 
the three blanks to give a list of five cardinal numbers, each of which is greater than the preceding one. 


No. IRI, —, —, —. 
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In Exercises 23 through 27, find the number of different partitions of a set having the given number of elements. 


23. 
26. 
28. 


1 element 24, 2 elements 25. 3 elements 
4 elements 27. 5 elements 
Consider a partition of a set S. The paragraph following Definition 0.18 explained why the relation 


x # y if and only if x and y are in the same cell 


satisfies the symmetric condition for an equivalence relation. Write similar explanations of why the reflexive 
and transitive properties are also satisifed. 


In Exercises 29 through 34, determine whether the given relation is an equivalence relation on the set. Describe the 
partition arising from each equivalence relation. 


29. 
31. 
33. 
34. 
35. 


36. 


37. 


n.ZeminZifnm > 0 30. x. #4yinRifx>y 
x # yinR if |x| = ly! 32. x zy inRif |x — y| <3 


n.&m inZ* ifn and m have the same number of digits in the usual base ten notation 
n.&m in Z* ifn and m have the same final digit in the usual base ten notation 


Using set notation of the form {#, #, #, - - -} for an infinite set, write the residue classes modulo n in Z* discussed 
in Example 0.17 for the indicated value of n. 


an=2 bn=3 en=5 


Let n € Zt and let ~ be defined on Z by r ~ s if and only if r — s is divisible by n, that is, if and only if 
r—s =ng forsomeg € Z. 


a. Show that ~ is an equivalence relation on Z. (It is called “congruence modulo n” just as it was for Z*. See 
part b.) 

b. Show that, when restricted to the subset Z* of Z, this ~ is the equivalence relation, congruence modulo n, 
of Example 0.20. 

c. The cells of this partition of Z are residue classes modulo n in Z. Repeat Exercise 35 for the residue classes 
modulo in Z rather than in Z* using the notation {.--, #,#, #.-- -} for these infinite sets. 


Students often misunderstand the concept of a one-to-one function (mapping). I think I know the reason. You 
see, a mapping @ : A > B has a direction associated with it, from A to B. It seems reasonable to expect a 
one-to-one mapping simply to be a mapping that carrics one point of A into one point of B, in the direction 
indicated by the arrow. But of course, every mapping of A into B does this, and Definition 0.12 did not say 
that at all. With this unfortunate situation in mind, make as good a pedagogical case as you can for calling the 
functions described in Definition 0.12 two-to-two functions instead. (Unfortunately, it is almost impossible to 
get widely used terminology changed.) 


Groups and Subgroups 


Section 1 Introduction and Examples 

Section 2. Binary Operations 

Section 3 |somorphic Binary Structures 

Section 4 Groups 

Section 5 Subgroups 

Section 6 = Cyclic Groups 

Section 7 Generating Sets and Cayley Digraphs 


INTRODUCTION AND EXAMPLES 


In this section, we attempt to give you a little idea of the nature of abstract algebra. 
We are all familiar with addition and multiplication of real numbers. Both addition 
and multiplication combine two numbers to obtain one number. For example, addition 
combines 2 and 3 to obtain 5. We consider addition and multiplication to be binary 
operations. In this text, we abstract this notion, and examine sets in which we have one 
or more binary operations. We think of a binary operation on a set as giving an algebra 
on the set, and we are interested in the structural properties of that algebra. To illustrate 
what we mean by a structural property with our familiar set R of real numbers, note 
that the equation x + x =a has a solution x in R for each a € R, namely, x = a/2. 
However, the corresponding multiplicative equation x -x = a does not have a solution 
in R if a < 0. Thus, R with addition has a different algebraic structure than R with 
multiplication. 

Sometimes two different sets with what we naturally regard as very different binary 
operations turn out to have the same algebraic structure. For example, we will see in 
Section 3 that the set R with addition has the same algebraic structure as the set R* of 
positive real numbers with multiplication! 

This section is designed to get you thinking about such things informally. We will 
make everything precise in Sections 2 and 3. We now turn to some examples. Multipli- 
cation of complex numbers of magnitude 1 provides us with several examples that will 
be useful and illuminating in our work. We start with a review of complex numbers and 
their multiplication. 
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Complex Numbers 


A real number can be visualized geometrically as a point on a line that we often regard 
as an x-axis. A complex number can be regarded as a point in the Euclidean plane, as 
shown in Fig. 1.1. Note that we label the vertical axis as the yi -axis rather than just the 
y-axis, and label the point one unit above the origin with i rather than 1. The point with 
Cartesian coordinates (a, b) is labeled a + bi in Fig. 1.1. The set C of complex numbers 
is defined by 


C= {at+bila,b € BR}. 


We consider R to be a subset of the complex numbers by identifying a real number r 
with the complex number r + 0i. For example, we write 3 + Oi as 3 and —1 + Oi as —w 
and 0 + 07 as 0. Similarly, we write 0+ li asi and 0+ si as si. 

Complex numbers were developed after the development of real numbers. The 
complex number i was invented to provide a solution to the quadratic equation x? = —1, 
so we require that 


i? = -1. (1) 


Unfortunately, i has been called an imaginary number, and this terminology has led 
generations of students to view the complex numbers with more skepticism than the real 
numbers. Actually, a// numbers, such as 1, 3, 7, —./3, andi are inventions of our minds. 
There is no physical entity that is the number 1. If there were, it would surely be in a 
place of honor in some great scientific museum, and past it would file a steady stream of 
mathematicians, gazing at 1 in wonder and awe. A basic goal of this text is to show how 
we can invent solutions of polynomial equations when the coefficients of the polynomial 
may not even be real numbers! 


Multiplication of Complex Numbers 


The product (a + bi)(c + di) is defined in the way it must be if we are to enjoy the 
familiar properties of real arithmetic and require that i? = —1, in accord with Eq. (1). 


1.2 Example 


Solution 
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Namely, we see that we want to have 
(a + bi)(e + di) = ac + adi + bei + bdi? 
=ac+adi+bci + bd(—1) 
= (ac — bd) + (ad + be)i. 


Consequently, we define multiplication of z] = a + bc and z2 =c + di as 


2122 = (a+ bi)(c + di) = (ac — bd) + (ad + be)i, (2) 


which is of the form r + si with r =ac — bd and s = ad + bc. It is routine to check 
that the usual properties z1Z2 = 2221. 21(Z273) = (Z122)z3 and z1(Z2 + 23) = 2122 + 2123 
all hold for all z;. 22, z3 € C. 


Compute (2 — 5i)(8 + 37). 
We don’t memorize Eq. (2), but rather we compute the product as we did to motivate 
that equation. We have 

(2 — 5i)(8 + 31) = 164 61 — 407 + 15 = 31 — 34). A 


To establish the geometric meaning of complex multiplication, we first define the abso- 
lute value |a + bi| of a+ bi by 


la+ bi] = fa? 4+ b?. (3) 


This absolute value is a nonnegative real number and is the distance from a + bi to the 
origin in Fig. 1.1. We can now describe a complex number z in the polar-coordinate form 


z = |z|(cos@ +i sin@), (4) 


where @ is the angle measured counterclockwise from the x-axis to the vector from 0 to 
z, as shown in Fig. 1.3. A famous formula due to Leonard Euler states that 


e? = cosO +i sing. 


Euler’s Formula 


We ask you to derive Euler’s formula formally from the power series expansions for 
e®.cos@ and sin @ in Exercise 41. Using this formula, we can express z in Eq. (4) as 


z= [z\(cos @+ isin 6) 


I 
I 
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I 
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z = |zle’’. Let us set 


10 12 


z1 = |z11e and Z2 = |z2\e 


and compute their product in this form, assuming that the usual laws of exponentiation 
hold with complex number exponents. We obtain 


zzz = lerle™ |zale® = lzallzale™ 


= |zi||z2|[cos(@1 + 42) + i sin(@) + 42)]. (5) 


Note that Eq. 5 concludes in the polar form of Eq. 4 where |z1z2| = |z1||z2| and the 
polar angle 6 for z;z2 is the sum @ = 6, + 62. Thus, geometrically, we multiply complex 
numbers by multiplying their absolute values and adding their polar angles, as shown 
in Fig. 1.4. Exercise 39 indicates how this can be derived via trigonometric identities 
without recourse to Euler’s formula and assumptions about complex exponentiation. 


oe l \7?2 | 


0 1 


> X 


1.4 Figure 1.5 Figure 


Note that i has polar angle 7/2 and absolute value 1, as shown in Fig. 1.5. Thus 7 
has polar angle 2(7/2) = m and |] - 1] = 1, so that i? = —1. 


Find all solutions in C of the equation z* =i. 


Writing the equation z2 = i in polar form and using Eq. (5), we obtain 
iz/*(cos 26 +i sin20) = 100 + i). 


Thus |z|? = 1, so |z| = 1. The angle 6 for z must satisfy cos 20 = 0 and sin20 = 1. 
Consequently, 26 = (7/2) + n(2z), so @ = (a /4) + nx for an integer n. The values of 
n yielding values @ where 0 < 6 < 27 are 0 and 1, yielding 6 = 7/4 or @ = 52/4. Our 
solutions are 


1 Pa i si i and 1 eee ate a 
= c t in ni 23. Os — sin —— 
Zl 4 FI : 22 a i st a 


or 


1 1 
zeit and zy = (1+). A 
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Find all solutions of z* = —16. 


As in Example 1.6 we write the equation in polar form, obtaining 
Iz\*(cos 40 +i sin49) = 16(—1 + Oi). 


Consequently, |z|* = 16, so |z| = 2 while cos 46 = —1 and sin4@ = 0. We find that 
49 =x +n(27), so 0 = (7/4) + n(1/2) for integers n. The different values of 0 ob- 
tained where 0 < @ < 2m are 1/4, 3m/4, 57/4, and 7/4. Thus one solution of z4 = 
—16is 


2( cos 7 isin? ) == f =i) 9/01 84), 


In a similar way, we find three more solutions, 
/2(-1+), v2(-1-), and V20-3. A 


The last two examples illustrate that we can find solutions of an equation z” = a + bi 
by writing the equation in polar form. There will always be n solutions, provided that 
a+ bi #0. Exercises 16 through 21 ask you to solve equations of this type. 

We will not use addition or division of complex numbers, but we probably should 
mention that addition is given by 


(a+ bi)+(c+di)=(at+e)+(b+a)i. (6) 


and division of a + bi by nonzero c + di can be performed using the device 


c+di c+di c—di_ ct d 
act+tbd bc-ad. 

5 aos sl. 

e+da C+a 


at bi at+bi e—di  (ac+bd)+ (be — ad)i 


(7) 


Algebra on Circles 


Let U = {z € C | |z| = 1}, so that U is the circle in the Euclidean plane with center at 
the origin and radius 1, as shown in Fig. 1.8. The relation |zjz2| = !z1|!z2| shows that 
the product of two numbers in U is again a number in U; we say that U is closed under 
multiplication. Thus, we can view multiplication in U as providing algebra on the circle 
in Fig. 1.8. 

As illustrated in Fig. 1.8, we associate with each z= cos@ +isiné@ in U a real 
number @ € R that lies in the half-open interval where 0 < @ < 27. This half-open 
interval is usually denoted by [0, 27), but we prefer to denote it by Ra, for reasons 
that will be apparent later. Recall that the angle associated with the product z)z2 of two 
complex numbers is the sum 6, + 02 of the associated angles. Of course if 6, + 02 > 27 
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then the angle in Ro, associated with z122 is 0; + 6) — 27. This gives us an addition 
modulo 27 on R2,,. We denote this addition here by +27. 


In Rox, we have 2% 42, % = YF -2n =F. A 


There was nothing special about the number 27 that enabled us to define addition on 
the half-open interval R2,. We can use any half-open interval R, = {x e RJIO<x < c}. 


In Ro3, we have 16 +23 19 = 35 — 23 = 12. In Rg 5, wehave6 +35 8 = 14-85 =5.5. 
A 


Now complex number multiplication on the circle U where {z| = 1 and addition 
modulo 27 on Ro, have the same algebraic properties. We have the natural one-to-one 
correspondence z <> 6 between z € U and @ € Roz indicated in Fig. 1.8. Moreover, we 
deliberately defined +2, so that 


if 2<o6 and 26, then 7-22 <> () ton 4). (8) 


isomorphism 


The relation (8) shows that if we rename each z € U by its corresponding angle 7] 
shown in Fig. 1.8, then the product of two elements in U is renamed by the sum of the 
angles for those two elements. Thus U with complex number multiplication and Roz 
with addition modulo 27 must have the same algebraic properties. They differ only in the 
names of the elements and the names of the operations. Such a one-to-one correspondence 
satisfying the relation (8) is called an isomorphism. Names of elements and names of 
binary operations are not important in abstract algebra; we are interested in algebraic 


1.11 Example 


1.12 Example 


1.13 Example 
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properties. We illustrate what we mean by saying that the algebraic properties of U and 
of R;, are the same. 


In U there is exactly one element e such that e- z = z for all z € U, namely, e = 1. 
The element 0 in Ro, that corresponds to 1 € U is the only element e¢ in Rp, such that 
é€ +2, x =x forall x € Ro. A 


The equation z-z-z-z = 1in U has exactly four solutions, namely, 1,7, —1, and —/. 
Now 1 € U andO € R>, correspond, and the equation x +2, x ta * +o, x = Oin Ro, 
has exactly four solutions, namely, 0, 7/2, 7, and 37/2, which, of course, correspond 
to 1, i, —1, and —1, respectively. A 


Because our circle U has radius 1, it has circumference 27 and the radian measure of 
an angle @ is equal to the length of the arc the angle subtends. If we pick up our half-open 
interval IR2,, put the 0 in the interval down on the 1 on the x-axis and wind it around the 
circle U counterclockwise, it will reach all the way back to 1. Moreover, each number 
in the interval will fall on the point of the circle having that number as the value of the 
central angle @ shown in Fig. 1.8. This shows that we could also think of addition on 
Ro, as being computed by adding lengths of subtended arcs counterclockwise, starting 
at z = 1, and subtracting 27 if the sum of the lengths is 27 or greater. 

If we think of addition on a circle in terms of adding lengths of arcs from a starting 
point P on the circle and proceeding counterclockwise, we can use a circle of radius 
2, which has circumference 47r, just as well as a circle of radius 1. We can take our 
half-open interval [R4, and wrap it around counterclockwise, starting at P; it will just 
cover the whole circle. Addition of arcs lengths gives us a notion of algebra for points on 
this circle of radius 2, which is surely isomorphic to Ry, with addition +4,. However, 
if we take as the circle |z| = 2 in Fig. 1.8, multiplication of complex numbers does not 
give us an algebra on this circle. The relation |z,z2| = |Z1|}z2| shows that the product of 
two such complex numbers has absolute value 4 rather than 2. Thus complex number 
multiplication is not closed on this circle. x 

The preceding paragraphs indicate that a little geometry can sometimes be of help 
in abstract algebra. We can use geometry to convince ourselves that IR2, and Ra, are 
isomorphic. Simply stretch out the interval R2, uniformly to cover the interval R4,,, or, 
if you prefer, use a magnifier of power 2. Thus we set up the one-to-one correspondence 
a <> 2a between a € Ro, and 2a € Ra,. The relation (8) for isomorphism becomes 


if a<2a and b<=2b then (a+), b) © Qa ta, 2b). (9) 
isomorphism 


This is obvious ifa +6 < 27. Ifa+b=2n +c, then 2a + 2b = 4m + 2c, and the 
final pairing in the displayed relation becomes c <> 2c, which is true. 


Xx +4, X +47 ¥ +4z x = 0 in Ra, has exactly four solutions, namely, 0, z, 27, and 37, 
which are two times the solutions found for the analogous equation in IR2, in Exam- 
ple 1.12. A 


18 


Part I 


1,14 Example 


1.15 Example 


Groups and Subgroups 


There is nothing special about the numbers 27 and 47 in the previous argument. 
Surely, R, with +, is isomorphic to Ry with +, for all c,d € R™. We need only pair 
x € R, with (d/c)x € Ry. 


Roots of Unity 


The elements of the set U, = {z € C|z” = 1} are called the n” roots of unity. Using the 
technique of Examples 1.6 and 1.7, we see that the elements of this set are the numbers 


2. 2 
cos (m2) +i sin (m=) for m=0.1,2,-:-,n-1. 
fi nh 


They all have absolute value 1, so U, C U. Tf we let ¢ = cos = +isin or, then these 
n® roots of unity can be written as 


ea ete ise (10) 


Because ¢” = 1, these n powers of ¢ are closed under multiplication. For example, with 
n = 10, we have 


cor8 ac a pt oct aet a 


Thus we sec that we can compute ¢'¢/ by computing i +, j, viewing i and j as elements 
of Ry. 

Let Z, = (0, 1,2,3,---, — 1}. We see that Z, C R, and clearly addition modulo 
n is closed on Zy. 


The solution of the equation x +5 = 3 in Zg isx = 6, because 5+g6= 11-8 =3. 
A 


If we rename each of the n™ roots of unity in (10) by its exponent, we use for names 
all the elements of Z,,. This gives a one-to-one correspondence between U, and Zp. 
Clearly, 


if cioi and cio j, then (¢'-¢/)< (itn). (41) 


isomorphism 


Thus U,, with complex number multiplication and Z, with addition +, have the same 
algebraic properties. 


It can be shown that there is an isomorphism of Us with Zg in which ¢ =e"? <= 5. 


Under this isomorphism, we must then have =o 6 eo 5495 =2. A 


Exercise 35 asks you to continue the computation in Example 1.15, finding the 
elements of Zg to which each of the remaining six elements of Ug correspond. 
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@ EXERCISES 1 


In Exercises 1 through 9 compute the given arithmetic expression and give the answer in the form a + bi for 
a,beR. 


1. 2 2. i4 3. i? 

4, (-i)® 5. (4—1)(5 + 37) 6. (8 + 21)B —i) 

7, (2—31)(4+1) + (6 — Si) 8. +i)? 9, (1 —1)° (Use the binomial theorem.) 
10. Find |3 — 4i|. 11. Find |6 + 4i|. 
In Exercises 12 through 15 write the given complex number z in the polar form |z|(p + gi) where |p + qi| = 1. 
12. 3-4 13. —1+i 14. 12+5i 15. —3 +57 
In Exercises 16 through 21, find all solutions in C of the given equation. 
16. 24 =1 17. 7=-1 18. 2 =—8 19, 2? = —27i 
20:.20-=1 21, 2° = —64 
In Exercises 22 through 27, compute the given expression using the indicated modular addition. 
22, 10 +47 16 23. 8 +106 24, 20.5 +25 19.3 
25, 44,2 26. Zt, % 27. 272 + qq 32 


28. Explain why the expression 5 +. 8 in Rg makes no sense. 


In Exercises 29 through 34, find all solutions x of the given equation. 


29. x +45 7 = 3 in Zj5 30. x +27 2 = 22 in Rox 
31. x+7x =3inZ 32. xt7x+7x =5inZ, 
33. xtpx =2in Zp 34. x+4xtax+4x =Oin Zy 


35. Example 1.15 asserts that there is an isomorphism of Ug with Zs in which ¢ = e'*/” < 5 and ¢? © 2. Find 
the clement of Zg that corresponds to each of the remaining six elements ¢” in Ug for m = 0,3, 4,5, 6, 
and 7. 


36. There is an isomorphism of U3 with Z; in which ¢ = e’°*/” <> 4, Find the element in Z7 to which ¢” must 
correspond for m = 0, 2, 3, 4, 5, and 6. 


37, Why can there be no isomorphism of Us with Z in which ¢ = e'*/>) corresponds to 4? 
38. Derive the formulas 
sin(a + b) = sinacosb+cosasinb 
and 


cos(a + b) = cosacosb — sinasinb 
by using Euler’s formula and computing ee’. 
39, Let z; = |z;|(cos @, +7 sin @)) and z2 = |Z2|(cos @) +7 sin@). Use the trigonometric identities in Exercise 38 
to derive z}Z2 = |z:||z2|{cos(0; + 62) + 7 sin(@; + 42)]. 
40. a. Derive a formula for cos 39 in terms of sin 6 and cos @ using Euler’s formula. 
b. Derive the formula cos 39 = 4cos? 6 — 3 cos @ from part (a) and the identity sin? 6 + cos? 6 = 1. (We will 
have use for this identity in Section 32.) 
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41. Recall the power series expansions 


x 1 ; x f; x x 
e =1+x¢4 TT T 31 + AI + + nl + ; 
x? x 7 est 2n-1 
sinx =x 31 + 517 ate +(-1) @ Di , and 
x? x x6 2n 
osx =1- ao ay ai 
oe oa eS Ge 


from calculus. Derive Euler’s formula e’® = cos@ + isin 6 formally from these three series expansions. 


2.1 Definition 


BINARY OPERATIONS 


Suppose that we are visitors to a strange civilization in a strange world and are observing 
one of the creatures of this world drilling a class of fellow creatures in addition of 
numbers. Suppose also that we have not been told that the class is learning to add, but 
were just placed as observers in the room where this was going on. We are asked to give a 
report on exactly what happens. The teacher makes noises that sound to us approximately 
like gloop, poyt. The class responds with bimr. The teacher then gives ompt, gaft, and the 
class responds with poyt. What are they doing? We cannot report that they are adding 
numbers, for we do not even know that the sounds are representing numbers. Of course, 
we do realize that there is communication going on. All we can say with any certainty is 
that these creatures know some rule, so that when certain pairs of things are designated 
in their language, one after another, like gloop, poyt, they are able to agree on a response, 
bimt. This same procedure goes on in addition drill in our first grade classes where a 
teacher may say four, seven, and the class responds with eleven. 

In our attempt to analyze addition and multiplication of numbers, we are thus led to 
the idea that addition is basically just a rule that people learn, enabling them to associate, 
with two numbers in a given order, some number as the answer. Multiplication is also 
such a tule, but a different rule. Note finally that in playing this game with students, 
teachers have to be a little careful of what two things they give to the class. If a first 
grade teacher suddenly inserts zen, sky, the class will be very confused. The rule is only 
defined for pairs of things from some specified set. 


Definitions and Examples 


As mathematicians, let us attempt to collect the core of these basic ideas in a useful 
definition, generalizing the notions of addition and multiplication of numbers. As we 
remarked in Section 0, we do not attempt to define a set. However, we can attempt to 
be somewhat mathematically precise, and we describe our generalizations as functions 
(see Definition 0.10 and Example 0.11) rather than as rules. Recall from Definition 0.4 
that for any set S, the set S x S consists of all ordered pairs (a, b) for elements a and b 
of S. 


A binary operation * ona set S$ is a function mapping S x S into S. For each (a, b) € 
S x S, we will denote the element *((a, b)) of S by a * b. i] 


2.2 Example 


2.3 Example 


2.4 Definition 


2.5 Example 


2.6 Example 
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Intuitively, we may regard a binary operation « on S as assigning, to each ordered 
pair (a, b) of elements of S, an element a x b of S. We proceed with examples. 


Our usual addition + is a binary operation on the set R. Our usual multiplication - is a 
different binary operation on R. In this example, we could replace R by any of the sets 
C, Z, Rt, or Zt. A 


Note that we require a binary operation on a set S to be defined for every ordered 
pair (a, b) of elements from S. 


Let M(R) be the set of all matrices’ with real entries. The usual matrix addition + is not 
a binary operation on this set since A + B is not defined for an ordered pair (A, B) of 
matrices having different numbers of rows or of columns. A 


Sometimes a binary operation on S provides a binary operation on a subset H of S 
also. We make a formal definition. a 


Let « be a binary operation on S$ and let H be a subset of S. The subset H is closed 
under x if for alla, b € H we also havea x b € H. In this case, the binary operation on 
H given by restricting « to H is the induced operation of * on H. | 


By our very definition of a binary operation * on S, the set § is closed under *, but 
a subset may not be, as the following example shows. 


Our usual addition + on the set R of real numbers does not induce a binary operation 
on the set R* of nonzero real numbers because 2 € R* and —2 € R*, but 2+ (—2) =0 
and 0 ¢ R*. Thus R* is not closed under x. A 


In our text, we will often have occasion to decide whether a subset H of S is closed 
under a binary operation « on S. To arrive at a correct conclusion, we have to know what 
it means for an element to be in H, and to use this fact. Students have trouble here. Be 
sure you understand the next example. 


Let + and - be the usual binary operations of addition and multiplication on the set 
Z, and let H = {n?{n € Z+}. Determine whether H is closed under (a) addition and 
(b) multiplication. 

For part (a), we need only observe that 1? = land2? = 4areinH, but that] + 4 =5 
and 5 ¢ H. Thus # is not closed under addition. 

For part (b), suppose that r ¢ H ands € H. Using what it means for r and s to be 
in H, we see that there must be integers n and m in Z* such that r = n? ands = m*. 
Consequently, rs = n?m* = (nm)’. By the characterization of elements in H and the 


fact that xm € ZT, this means that rs € H, so H is closed under multiplication. A 


* Most students of abstract algebra have studied linear algebra and are familiar with matrices and matrix 
operations. For the benefit of those students, examples involving matrices are often given. The reader who is 
not familiar with matrices can either skip all references to them or turn to the Appendix at the back of the text, 
where there is a short summary. 
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2.7 Example Let F be the set of all real-valued functions f having as domain the set R of real numbers. 
We are familiar from calculus with the binary operations +, —, -, and o on F. Namely, 
for each ordered pair (f, g) of functions in F, we define for each x € R 


figby(f+a)@) = fi) + ex) addition, 
f—ge by (f — gx) = f(x) — g(x) _ subtraction, 
f-gby (fg) = fg) multiplication, 


and 
fog by (fogx) = f(s) composition. 


All four of these functions are again real valued with domain R, so F is closed under all 
four operations +, —, -, and o. A 


The binary operations described in the examples above are very familiar to you. 
In this text, we want to abstract basic structural concepts from our familiar algebra. 
To emphasize this concept of abstraction from the familiar, we should illustrate these 
structural concepts with unfamiliar examples. We presented the binary operations of 
complex number multiplication on U and U,, addition +, on Zn, and addition +, on R, 
in Section I. 

The most important method of describing a particular binary operation * on a given 
set is to characterize the element a * b assigned to each pair (a, b) by some property 
defined in terms of a and b. 


2.8 Example On Zt, we define a binary operation * by a * b equals the smaller of a and b, or the 
common value if a = b. Thus 2 * 11 = 2;15 * 10 = 10; and 3 *3 =3, A 


2.9 Example On Z*, we define a binary operation *’ by a *’ b =a. Thus 2 * 3 = 2,25’ 10 = 25, 
and5 *’5=5. A 


2.10 Example On Zt, we define a binary operation *” by a *” b = (a * b) + 2, where » is defined in 
Example 2.8. Thus 4 *” 7 = 6;25 *” 9 = 11; and6*"6 = 8. A 


It may seem that these examples are of no importance, but consider for a moment. 
Suppose we go into a store to buy a large, delicious chocolate bar. Suppose we see two 
identical bars side by side, the wrapper of one stamped $1.67 and the wrapper of the 
other stamped $1.79. Of course we pick up the one stamped $1.67. Our knowledge of 
which one we want depends on the fact that at some time we learned the binary operation 
* of Example 2.8. It is a very important operation. Likewise, the binary operation *' of 
Example 2.9 is defined using our ability to distinguish order. Think what a problem we 
would have if we tried to put on our shoes first, and then our socks! Thus we should 
not be hasty about dismissing some binary operation as being of little significance. Of 
course, our usual operations of addition and multiplication of numbers have a practical 
importance well known to us. 

Examples 2.8 and 2.9 were chosen to demonstrate that a binary operation may or 
may not depend on the order of the given pair. Thus in Example 2.8, a x b = b xa for 
alla, b € Z, and in Example 2.9 this is not the case, for 5 *’ 7 = 5 but 7 * 5=—7. 


2.11 Definition 


2.12 Definition 


2.13 Theorem 


Proof 
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A binary operation * on a set S$ is commutative if (and only if) a * b = b xa for all 
a,beSs, | 


As was pointed out in Section 0, it is customary in mathematics to omit the words 
and only if from a definition. Definitions are always understood to be if and only if 
statements. Theorems are not always if and only if statements, and no such convention 
is ever used for theorems. 

Now suppose we wish to consider an expression of the form a * b * c. A binary 
operation * enables us to combine only two elements, and here we have three. The obvious 
attempts to combine the three elements are to form either (a * b) * c or a * (b * c). With 
* defined as in Example 2.8, (2 * 5) * 9 is computed by 2 * 5 = 2 and then 2* 9 = 2, 
Likewise, 2 « (5 * 9) is computed by 5 « 9 = 5 and then 2 * 5 = 2. Hence (2 * 5) *9 = 
2 * (5 * 9), and it is not hard to see that for this +, 


(axe b)*c=ax(bxc), 

so there is no ambiguity in writing a * b + c. But for *” of Example 2.10, 

2%" 5) %"9 = 4x" 9 = 6, 
while 

2%" (S49 =24"7=4, 
Thus (a *” b) x” c need not equal a x” (b x” c), and an expression a *” b *” c may be 
ambiguous. 
A binary operation on a set S is associative if (a * b) *c =a *(b*c)foralla,b,c €S. 

a 


It can be shown that if * is associative, then longer expressions such as a * b x 
c*d are not ambiguous. Parentheses may be inserted in any fashion for purposes of 
computation; the final results of two such computations will be the same. 

Composition of functions mapping R into R was reviewed in Example 2.7. For any 
set S and any functions f and g mapping S' into S, we similarly define the composition 
J og ofg followed by f as the function mapping S into S$ such that (f o g)(x) = f(g(x)) 
for all x € S. Some of the most important binary operations we consider are defined 
using composition of functions. It is important to know that this composition is always 
associative whenever it is defined. 


(Associativity of Composition) Let S be aset and let f, g, and A be functions mapping 
S into §. Then f o(goh)=(f ogjoh. 


To show these two functions are equal, we must show that they give the same assignment 
to each x € S. Computing we find that 


(f o(g oh))(x) = f(g oh) = f(gA@)) 
and 
(fo gsoh)@) =f o gh) = f(gA@)), 
so the same element f(g(h(x))) of S is indeed obtained. Sd 
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2.14 Example 


2.15 Table 


2.16 Example 


Solution 
2.17 Table 


a|bl|lc|d 

a d\iaja 

bid c|b 
[ 

clale b 
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As an example of using Theorem 2.13 to save work, recall that it 1s a fairly painful 
exercise in summation notation to show that multiplication of n x n matrices is an 
associative binary operation. If, in a linear algebra course, we first show that there 
is a one-to-one correspondence between matrices and linear transformations and that 
multiplication of matrices corresponds to the composition of the linear transformations 
(functions), we obtain this associativity at once from Theorem 2.13. 


Tables 


For a finite set, a binary operation on the set can be defined by means of a table in which 
the elements of the set are listed across the top as heads of columns and at the left side 
as heads of rows. We always require that the elements of the set be listed as heads across 
the top in the same order as heads down the left side. The next example illustrates the 
use of a table to define a binary operation. 


Table 2.15 defines the binary operation * on S = {a, b, c} by the following rule: 


(ith entry on the left) * (jth entry on the top) 
= (entry in the ith row and jth column of the table body). 


Thusa*b =candb*a =a, so * is not commutative. A 


We can easily see that a binary operation defined by a table is commutative if and 
only if the entries in the table are symmetric with respect to the diagonal that starts at 
the upper left corner of the table and terminates at the lower right corner. 


Complete Table 2.17 so that * is a commutative binary operation on the set S= 
{a, b,c, d}. 


From Table 2.17, we see that b x a = d. For « to be commutative, we must havea *x b= 
d also. Thus we place d in the appropriate square defining a * b, which is located 
symmetrically across the diagonal in Table 2.18 from the square defining b «a. We 
obtain the rest of Table 2.18 in this fashion to give our solution. A 


Some Words of Warning 


Classroom experience shows the chaos that may result if a student is given a set and 
asked to define some binary operation on it. Remember that in an attempt to define a 
binary operation * on a set S we must be sure that 


1. exactly one element is assigned to each possible ordered pair of elements of S, 


2. for each ordered pair of elements of S, the element assigned to it is again in S. 


Regarding Condition 1, a student will often make an attempt that assigns an element 
of S to “most” ordered pairs, but for a few pairs, determines no element. In this event, 
* is not everywhere defined on S. It may also happen that for some pairs, the at- 


d|laj;\bib ~ tempt could assign any of several elements of S, that is, there is ambiguity. In any case 


2.19 Example 


2.20 Example 


2.21 Example 


2.22 Example 


2.23 Example 


2.24 Example 


2.25 Example 
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of ambiguity, + is not well defined. If Condition 2 is violated, then S is not closed 
under x. 

Following are several illustrations of attempts to define binary operations on sets. 
Some of them are worthless. The symbol * is used for the attempted operation in all 
these examples. 


On Q, let a * b = a/b. Here x is not everywhere defined on Q, for no rational number is 
assigned by this rule to the pair (2, 0). A 


On Q”, let a * b = a/b. Here both Conditions | and 2 are satisfied, and x is a binary 
operation on Q*. A 


On Zt, let a * b = a/b. Here Condition 2 fails, for 1 x 3 is not in Z*. Thus x is not a 
binary operation on Z*, since Zt is not closed under x. A 


Let F be the set of all real-valued functions with domain R as in Example 2.7. Suppose 
we “define” * to give the usual quotient of f by g, that is, f * g =h, where h(x) = 
f(x)/g(x). Here Condition 2 is violated, for the functions in F were to be defined for 
all real numbers, and for some g € F, g(x) will be zero for some values of x in R and 
h(x) would not be defined at those numbers in R. For example, if f(x) = cosx and 
g(x) = x’, then h(O) is undefined, soh ¢ F. A 


Let F be as in Example 2.22 and let f * g =h, where h is the function greater than 
both f and g. This “definition” is completely worthless. In the first place, we have not 
defined what it means for one function to be greater than another. Even if we had, any 
sensible definition would result in there being many functions greater than both f and 
g, and x would still be not well defined. A 


Let S be a set consisting of 20 people, no two of whom are of the same height. Define 
« by a * b = c, where c is the tallest person among the 20 in S. This is a perfectly good 
binary operation on the set, although not a particularly interesting one. A 


Let S be as in Example 2.24 and let a * b = c, where c is the shortest person in S who 
is taller than both a and b. This x is not everywhere defined, since if either a or b is the 
tallest person in the set, a * b is not determined. A 


@ EXERCISES 2 


Computations 


Exercises | through 4 concern the binary operation « defined on S = {a, b, c, d, e} by means of Table 2.26. 


1. Compute b «d, cc, and [(a * c) * e] «a. 


2. Compute (a * b) * c and a * (b x c). Can you say on the basis of this computations whether * is associative? 


3. Compute (b « d) * c and b * (d * c). Can you say on the basis of this computation whether » is associative? 
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2.26 Table 2.27 Table 2.28 Table 


4, Is * commutative? Why? 
5. Complete Table 2.27 so as to define a commutative binary operation + on S = {a, b,c, d}. 
6. Table 2.28 can be completed to define an associative binary operation * on S = fa, b,c, d}. Assume this is 
possible and compute the missing entries. 
In Exercises 7 through 11, determine whether the binary operation * defined is commutative and whether + is 
associative. 
7, * defined on Z by letting a* b =a —b 
8. x defined on Q by lettinga*b=ab+1 
9, x defined on Q by letting a ¥ b = ab/2 
10. * defined on Z* by letting a * b = 2° 
11. « defined on Z* by lettinga +b =a? 


12. Let S be a set having exactly one element. How many different binary operations can be defined on S? Answer 
the question if S has exactly 2 elements; exactly 3 elements, exactly n elements. 


13. How many different commutative binary operations can be defined on a set of 2 elements? on a set of 3 
elements? on a set of n elements? 


Concepts 


In Exercises 14 through 16, correct the definition of the italicized term without reference to the text, if correction 
is needed, so that it is in a form acceptable for publication. 


14. A binary operation * is commutative if and only ifa*b = b*a. 


15. A binary operation * on a set S is associative if and only if, for all a,b,c eS, we have 
(b*xc)*xa =bx*(c*a). 


16. A subset H of aset S is closed under a binary operation * on S if and only if (a * b) € H foralla,be S. 
In Exercises 17 through 22, determine whether the definition of * does give a binary operation on the set. In the 


event that « is not a binary operation, state whether Condition 1, Condition 2, or both of these conditions on page 24 
are violated. 


17. On Z", define x by letting a *b =a — b. 

18. On Z*, define x by letting a * b =a’. 

19. On R, define x by letting a*b =a —b. 

20. On Zt, define * by letting a * b = c, where c is the smallest integer greater than both a and b. 
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21. On Zt, define * by letting a x b = c, where c is at least 5 more than a + b. 


22. On Z", define « by letting a x b = c, where c is the largest integer less than the product of a and b. 
a 


23. Let H be the subset of M>(IR) consisting of all matrices of the form [ b 


a matrix addition? b matrix multiplication? 


| for a, b € R. Is H closed under 


24. Mark each of the following true or false. 


a. If * is any binary operation on any set S, thena «a =a forallae S. 


b. If * is any commutative binary operation on any set S, then a *(b *c) = (b *c) «a for all a, b, 
ces. 


c. If * is any associative binary operation on any set S, thena * (b xc) = (b« c) x a foralla, b,c € S. 


d. The only binary operations of any importance are those defined on sets of numbers. 


e, A binary operation * on a set S is commutative if there exist a, b € S such thata+b =bxa. 


f. Every binary operation defined on a set having exactly one element is both commutative and 
associative. 


g. A binary operation on a set S assigns at least one element of S to each ordered pair of elements 
of S. 


h. A binary operation ona set S assigns at most one element of 5 to each ordered pair of elements of 
S. 

i. A binary operation on a set S assigns exactly one element of S to each ordered pair of elements 
of S. 


j. A binary operation on a set S may assign more than one element of S to some ordered pair of 
elements of S. 


25. Give a set different from any of those described in the examples of the text and not a set of numbers. Define 
two different binary operations * and x’ on this set. Be sure that your set is well defined. 


Theory 


26. Prove that if x is an associative and commutative binary operation on a set S, then 
(ax b)*(c*xd) = [(d *c) xa] *b 
for all a, b, c,d € S. Assume the associative law only for triples as in the definition, that is, assume only 
(xxy)*ez=x«(y *Z) 
forallx,y,ze€S. 


In Exercises 27 and 28, either prove the statement or give a counterexample. 
27. Every binary operation on a set consisting of a single element in both commutative and associative. 


28. Every commutative binary operation on a set having just two elements is associative. 


Let F be the set of all real-valued functions having as domain the set R of all real numbers. Example 2.7 defined 
the binary operations +, —, -, and o on F. In Exercises 29 through 35, either prove the given statement or give a 
counterexample. 


29. Function addition + on F is associative. 


30. Function subtraction — on F is commutative 
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31. 
32. 
33. 
34. 
35. 


36. 


37. 
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Function subtraction — on F is associative. 
Function multiplication - on F is commutative. 
Function multiplication - on F is associative. 
Function composition o on F is commutative. 


If * and *’ are any two binary operations on a set S, then 
ax(b* c) =(a*b)x' (a*c) forall a,b,ceS. 


Suppose that * is an associative binary operation on a set S. Let H = {a € S|axx =x «a for all x € S}. 
Show that H is closed under +. (We think of H as consisting of all elements of S that commute with every 
element in S.) 

Suppose that + is an associative and cormmutative binary operation on a set S.Show that H = {a ¢ Sjaxa=a} 
is closed under *. (The elements of H are idempotents of the binary operation *.) 


IsomorpPHic BINARY STRUCTURES 


Compare Table 3.1 for the binary operation * on the set S = {a, b, c} with Table 3.2 for 
the binary operation *’ on the set T = {#, 5, &}. 

Notice that if, in Table 3.1, we replace all occurrences of a by #, every b by $, and 
every c by & using the one-to-one correspondence 


ao# bef cok 


we obtain precisely Table 3.2. The two tables differ only in the symbols (or names) 
denoting the elements and the symbols » and +’ for the operations. If we rewrite Table 3.3 
with elements in the order y, x, z, we obtain Table 3.4. (Here we did not set up any one- 
one-correpondence; we just listed the same elements in different order outside the heavy 
bars of the table.) Replacing, in Table 3.1, all occurrences of a by y, every b by x, and 
every c by z using the one-to-one correspondence 


a<cy box COZ 


we obtain Table 3.4. We think of Tables 3.1, 3.2, 3.3, and 3.4 as being structurally alike. 
These four tables differ only in the names (or symbols) for their elements and in the 
order that those elements are listed as heads in the tables. However, Table 3.5 for binary 
operation # and Table 3.6 for binary operation % on the set S = {a, b, c} are structurally 
different from each other and from Table 3.1. In Table 3.1, each element appears three 
times in the body of the table, while the body of Table 3.5 contains the single element b. 
In Table 3.6, for all s € S we get the same value c for s # s along the upper-left to lower- 
right diagonal, while we get three different values in Table 3.1. Thus Tables 3.1 through 
3.6 give just three structurally different binary operations on a set of three elements, 
provided we disregard the names of the elements and the order in which they appear as 
heads in the tables. 

The situation we have just discussed is somewhat akin to children in France and in 
Germany learning the operation of addition on the set Z*. The children have different 
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3.1 Table 3.2 Table 3.3 Table 


names (un, deux, trois, - -- versus ein, zwei, drei - - -) for the numbers, but they are learning 
the same binary structure. (In this case, they are also using the same symbols for the 
numbers, so their addition tables would appear the same if they list the numbers in the 
same order.) 

We are interested in studying the different types of structures that binary operations 
can provide on sets having the same number of elements, as typified by Tables 3.4, 3.5, 
and 3.6. Let us consider a binary algebraic structure’ (5, «) to be a set § together with 
a binary operation * on S. In order for two such binary structures (S, *) and (S’, *’) to 
be structurally alike in the sense we have described, we would have to have a one-to-one 
correspondence between the elements x of S and the elements x’ of S’ such that 

if xox’ and yoy’, then x*tyox's'y’, (1) 

A one-to-one correspondence exists if the sets S and S’ have the same number of 
elements. It is customary to describe a one-to-one correspondence by giving a one- 
to-one function @ mapping S onto S' (see Definition 0.12). For such a function ¢, we 
regard the equation ¢(x) = x’ as reading the one-to-one pairing x < x in left-to-right 
order. In terms of @¢, the final < correspondence in (1), which asserts the algebraic 
structure in S’ is the same as in S, can be expressed as 


p(x ey) = o(x) ¥’ Hy). 


Such, a function showing that two algebraic systems are structurally alike is known as 
an isomorphism. We give a formal definition. 


Let (S, *) and (S’, *’) be binary algebraic structures. An isomorphism of S with S’ is a 
one-to-one function ¢ mapping S onto S’ such that 
P(x x y) = b(y) *' P(y) for all x, y € S. 
homomorphism property 


(2) 


* Remember that boldface type indicates that a term is being defined. 
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3.8 Example 


If such a map @ exists, then $ and S’ are isomorphic binary structures, which we 
denote by S ~ S’, omitting the + and *’ from the notation. | 


You may wonder why we labeled the displayed condition in Definition 3.7 the ho- 
momorphism property rather than the isomorphism property. The notion of isomorphism 
includes the idea of one-to-one correspondence, which appeared in the definition via the 
words one-to-one and onto before the display. In Chapter 13, we will discuss the rela- 
tion between S and S’ when ¢ : S — S’ satisfies the displayed homomorphism property, 
but ¢ is not necessarily one to one; ¢ is then called a homomorphism rather than an 
isomorphism. 

It is apparent that in Section 1, we showed that the binary structures (U, -) and 
(R., +¢) are isomorphic for all c € Rt. Also, (U,,-) and (Zn, +n) are isomorphic for 
eachn € Zt, 

Exercise 27 asks us to show that for a collection of binary algebraic structures, the 
relation ~ in Definition 3.7 is an equivalence relation on the collection. Our discussion 
leading to the preceding definition shows that the binary structures defined by Tables 3.1 
through 3.4 are in the same equivalence class, while those given by Tables 3.5 and 3.6 are 
in different equivalence classes. We proceed to discuss how to try to determine whether 
binary structures are isomorphic. 


How to Show That Binary Structures Are Isomorphic 


We now give an outline showing how to proceed from Definition 3.7 to show that two 
binary structures (S, *) and (S’, *’} are isomorphic. 


Step 1 Define the function ¢ that gives the isomorphism of S with S’, Now this 
means that we have to describe, in some fashion, what $(s) is to be for every s ¢ S. 
Step 2 Show that ¢ is a one-to-one function. That is, suppose that (x) = o(y) 
in S’ and deduce from this that x = y in S. 

Step 3 Show that ¢ is onto S’. That is, suppose that s’ € S’ is given and show that 
there does exist s € S such that d(s) = s’. 


Step 4 Show that b(x * y) = @(x) * (y) for all x, y € S. This is just a question 
of computation. Compute both sides of the equation and see whether they are the 
same. 


Let us show that the binary structure (R, +) with operation the usual addition is isomor- 
phic to the structure (R*, -) where - is the usual multiplication. 


Step 1 We have to somchow convert an operation of addition to multiplication. 
Recall from a?+© = (a’)(a‘) that addition of exponents corresponds to 
multiplication of two quantities. Thus we try defining ¢: R > Rt by 
(x) = e* for x € R. Note that e* > 0 forall x € R, so indeed, 

o(x) € Rt. 

Step 2 If @(x) = ¢(), then e* = e”. Taking the natural logarithm, we see that 

x = y, so @ is indeed one to one. 


3.9 Example 


3.10 Example 
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Step 3 Ifr € Rt, then In(r) € R and g(nr) = e™” =r. Thus ¢ is onto Rt. 


Step 4 Forx, y € R, we have (x + y) = e* 1” =e* - e? = G(x) - b(y). Thus we 
see that @ is indeed an isomorphism. A 


as 


Let 2Z = {2n |n € Z}, so that 2Z is the set of all even integers, positive, negative, and 
zero. We claim that (Z, +) is isomorphic to (2Z, +), where + is the usual addition. This 
will give an example of a binary structure (Z, +) that is actually isomorphic to a structure 
consisting of a proper subset under the induced operation, in contrast to Example 3.8, 
where the operations were totally different. 


Step 1 The obvious function ¢ : Z — 2Z to try is given by ¢(n) = 2n forn € Z. 
Step 2 If (mn) = (n), then 2m = 2n som =n. Thus ¢ is one to one. 


Step3 Ifn € 2Z, thenn is even son = 2m form =n/2 € Z. Hence 
o(m) = 2(n/2) =n so @ is onto 2Z. 
Step4 Letm,n € Z. The equation 


o(m +n) = 2(m +n) = 2m +2n = O(m) + O(n) 
then shows that @ is an isomorphism. A 


How to Show That Binary Structures Are Not Isomorphic 


We now turn to the reverse question, namely: 


How do we demonstrate that two binary structures (S, *) and (S’, *') are not 


isomorphic, if this is the case? 


This would mean that there is no one-to-one function @ from S onto S’ with the property 
b(x * y) = d(x) *’ b(y) for all x, y € S. In general, it is clearly not feasible to try every 
possible one-to-one function mapping S onto S’ and test whether it has this property, 
except in the case where there are no such functions. This is the case precisely when S$ 
and S” do not have the same cardinality. (See Definition 0.13.) 


The binary structures (Q, +) and (R, +) are not isomorphic because Q has cardinality Xo 
while {R| 4 Xo. (See the discussion following Example 0.13.) Note that it is not enough 
to say that Q is a proper subset of R. Example 3.9 shows that a proper subset with the 
induced operation can indeed be isomorphic to the entire binary structure. A 


A structural property of a binary structure is one that must be shared by any 
isomorphic structure. It is not concerned with names or some other nonstructural char- 
acteristics of the elements. For example, the binary structures defined by Tables 3.1 and 
3.2 are isomorphic, although the elements are totally different. Also, a structural prop- 
erty is not concerned with what we consider to be the “name” of the binary operation. 
Example 3.8 showed that a binary structure whose operation is our usual addition can be 
isomorphic to one whose operation is our usual multiplication. The number of elements 
in the set S is a structural property of (5, *). 
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3.13 Theorem 


Proof 
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In the event that there are one-to-one mappings of S onto S’, we usually show that 
(S, x) is not isomorphic to (5’, *’) (if this is the case) by showing that one has 
some structural property that the other does not possess. 


The sets Z and Zt both have cardinality Xo, and there are lots of one-to-one functions 
mapping Z onto Z*. However, the binary structures (Z, +) and (Z*, +), where - is the 
usual multiplication, are not isomorphic. In (Z, -) there are two elements x such that 
x +x =x, namely, 0 and |. However, in (Z*, -), there is only the singleelement 1. & 


We list a few examples of possible structural properties and nonstructural properties 
of a binary structure (S, *) to get you thinking along the right line. 


Possible Structural Properties Possible Nonstructural Properties 
1. The set has 4 elements. a. The number 4 is an element. 

2. The operation is commutative. b. The operation is called “addition.” 
3.x%*x =x forallx eS. c. The elements of S$ are matrices. 

4. The equation a «x = bhasa d. S is a subset of C. 


solution x in § for alla,b € S. 


We introduced the algebraic notions of commutativity and associativity in Section 2. 
One other structural notion that will be of interest to us is illustrated by Table 3.3, 
where for the binary operation *” on the set {x, y, z}, we have x *"u=u aX =U 
for all choices possible choices, x, y, and z for u. Thus x plays the same role as 0 in 
(UR, +) where 0+ 4 =u+0=u for all u € R, and the same role as 1 in (R, -) where 
1-u=u-1=u forall u € R. Because Tables 3.1 and 3.2 give structures isomorphic 
to the one in Table 3.3, they must exhibit an element with a similar property. We see that 
bxu=u*b =u forallelements u appearing in Table 3.1 and that $ *°u=ux'$=u 
for all elements u in Table 3.2. We give a formal definition of this structural notion and 
prove a little theorem. 


Let (S,*) be a binary structure. An element e of S is an identity element for « if 
exs=sxe=sforalls eS. a 


(Uniqueness of Identity Element) A binary structure (S, *) has at most one identity 
element. That is, if there is an identity element, it is unique. 


Proceeding in the standard way to show uniqueness, suppose that both e and é are ele- 
ments of S serving as identity elements. We let them compete with each other. Regarding 
é as an identity element, we must have e * @ = @. However, regarding é as an identity 
element, we must have e * @ = e. We thus obtain e = e, showing that an identity element 
must be unique. ¢ 


3.14 Theorem 


Proof 


3.15 Example 


3.16 Example 


3.17 Example 
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If you now have a good grasp of the notion of isomorphic binary structures, it 
should be evident that having an identity element for * is indeed a structural property of 
a structure (S$, «). However, we know from experience that many readers will be unable 
to see the forest because of all the trees that have appeared. For them, we now supply a 
careful proof, skipping along to touch those trees that are involved. 


Suppose (5, «) has an identity element ¢ for *«. If @ : S + S' isan isomorphism of (S, *) 
with ($’, x’), then $(e) is an identity element for the bmary operation *’ on S$”. 


Lets’ € S’. We must show that d(e) ¥ s’ = s’ *’ @(e) = s’. Because ¢ is an isomorphism, 
it is a one-to-one map of S onto S$’. In particular, there exists s € S such that @(s) = s’. 
Now e is an identity element for * so that we know that e x s = s *e = s. Because ¢ is 
a function, we then obtain 


ple x 5) = Hs *e) = G(s). 
Using Definition 3.7 of an isomorphism, we can rewrite this as 
oe) *' b(s) = $(s) * P(e) = A). 


Remembering that we chose s € S such that o(s) = s’, we obtain the desired relation 
d(e)*'s’ = s'* de =s". ¢ 


We conclude with three more examples showing via structural properties that cer- 
tain binary structures are not isomorphic. In the exercises we ask you to show, as in 
Theorem 3.14, that the properties we use to distinguish the structures in these examples 
are indeed structural. That is, they must be shared by any isomorphic structure. 


We show that the binary structures (Q, +) and (Z, +) under the usual addition are not 
isomorphic. (Both Q and Z have cardinality Xo, so there are lots of one-to-one functions 
mapping Q onto Z.) The equation x + x = c has a solution x for all c € Q, but this is 
not the case in Z. For example, the equation x + x = 3 has no solution in Z. We have 
exhibited a structural property that distinguishes these two structures. A 


The binary structures (C, -) and (R, -) under the usual multiplication are not isomorphic. 
(It can be shown that C and R have the same cardinality.) The equation x - x = c has a 
solution x for all c € C, but x - x = —1 has no solution in R. A 


The binary structure (M>(R), -) of2 x 2 real matrices with the usual matrix multiplication 
is notisomorphic to (R, -) with the usual number multiplication. (It can be shown that both 
sets have cardinality |R|.) Multiplication of numbers is commutative, but multiplication 
of matrices is not. A 
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@ EXERCISES 3 


In all the exercises, + is the usual addition on the set where it is specified, and - is the usual multiplication. 


Computations 
1. What three things must we check to determine whether a function @: § > S' is an isomorphism of a binary 
structure (S, *) with (S’, *’)? 


In Exercises 2 through 10, determine whether the given map ¢ is an isomorphism of the first binary structure with 
the second. (See Exercise 1.) If it is not an isomorphism, why not? 


2. (Z, +) with (Z, +) where b(n) = —n forn € Z, 

(Z, +) with (Z, +) where o(n) = 2n forn € Z 

(Z, +) with (Z, +) where d(n) =n +1 forne Zz 

(Q, +) with (Q, +) where (x) = x/2 forx € Q 

(Q, -) with (Q, -) where (x) = x* for x € Q 

. (R, -) with (R, -) where o(x) = x° forx €R 

(M>(R), -) with (R, -) where (A) is the determinant of matrix A 
(M,(R), -) with (R, -) where $(A) is the determinant of matrix A 
10. (R, +) with (Rt, -) where g(r) = 0.5" forr eR 


err an sw 


In Exercises 11 through 15, let F be the set of all functions f mapping R into R that have derivatives of all orders. 
Follow the instructions for Exercises 2 through 10. 

11. (F, +) with (F, +) where $(f) = f', the derivative of f 

12. (F, +) with (R, +) where 6(f) = f'(0) 

13. (F, +) with (F, +) where o(f)(x) = fh f(@dt 

14. (F, +) with (F, +) where (f(x) = 4iff f@dt] 

15. (F,-) with (F, -) where @(f)(x) = x f(x) 


16. The map ¢ : Z— Z defined by o(n) =n +1 forne Z is one to one and onto Z. Give the definition of a 
binary operation * on Z such that ¢ is an isomorphism mapping 


a. (Z, +) onto (Z, *}, b. (Z, *) onto (Z, +). 
In each case, give the identity element for * on Z. 


17. The map ¢ : Z > Z defined by ¢(7) =n +1 forn é€ Z is one to one and onto Z. Give the definition of a 
binary operation « on Z such that ¢ is an isomorphism mapping 


a. (Z, +) onto (Z, +), b. (Z, *) onto (Z, -). 
In each case, give the identity element for « on Z. 


18. The map ¢ : Q > Q defined by o(x) = 3x — 1 forx é Q is one to one and onto Q. Give the definition of a 
binary operation * on Q such that @ is an isomorphism mapping 


a. (Q, +) onto (Q, *), b. (Q, *) onto (Q, +). 


In each case, give the identity element for * on Q. 


Section3 Exercises 35 


19, The map ¢ : Q > Q defined by $(x) = 3x — 1 for x € Q is one to one and onto Q. Give the definition of a 
binary operation * on Q such that ¢ is an isomorphism mapping 


a. (Q, -) onto (Q, *), b. (Q, *) onto (Q,-). 


In each case, give the identity element for * on Q. 


Concepts 


20. The displayed homomorphism condition for an isomorphism ¢ in Definition 3.7 is sometimes summarized 
by saying, “@ must commute with the binary operation(s).” Explain how that condition can be viewed in this 
manner. 


In Exercises 21 and 22, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 


21. A function ¢ : § — S’ is an isomorphism if and only if d(a « b) = b(a) x’ o(b). 


22. Let * be a binary operation on a set S. An element e of S with the property s*e =s =exs is an identity 
element for * for alls € S. 


Proof Synopsis 


A good test of your understanding of a proof is your ability to give a one or two sentence synopsis of it, explaining 
the idea of the proof without all the details and computations. Note that we said “sentence” and not “equation.” 
From now on, some of our exercise sets may contain one or two problems asking for a synopsis of a proof in the 
text. It should rarely exceed three sentences. We should illustrate for you what we mean by a synopsis. Here is our 
one-sentence synopsis of Theorem 3.14. Read the statement of the theorem now, and then our synopsis. 


Representing an element of S’ as d(s) for some s € S, use the homomorphism property 
of ¢ to carry the computation of p(e) *’ @(s) back to a computation in S. 
That is the kind of explanation that one mathematician might give another if asked, “How does the proof go?” 


We did not make the computation or explain why we could represent an element of S’ as @(s). To supply every 
detail would result in a completely written proof. We just gave the guts of the argument in our synopsis. 


23. Give a proof synopsis of Theorem 3.13. 


Theory 


24. An identity element for a binary operation * as described by Definition 3.12 is sometimes referred to as “a 
two-sided identity element.” Using complete sentences, give analogous definitions for 


a. a left identity element e; for *, and b. a right identity element ep for x. 


Theorem 3.13 shows that if a two-sided identity element for « exists, it is unique. Is the same true for a one-sided 
identity element you just defined? If so, prove it. If not, give a counterexample (S, *) for a finite set S and find 
the first place where the proof of Theorem 3.13 breaks down. 


25. Continuing the ideas of Exercise 24 can a binary structure have a left identity element e; and a right identity 
element eg where e; # eg? If so, give an example, using an operation on a finite set 5. If not, prove that it is 
impossible. 
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26. 


27. 


28. 
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Recall thatif f : A > Bisa one-to-one function mapping Aonto B, then f~!(b)is the unique a € A such that 
f(a) = b. Prove that if 6: S > S’ is an isomorphism of (S, *) with (S’, *’), then @7' is an isomorphism of 
(S’, *') with (S, *). 

Prove that if @ : S > S’ is an isomorphism of (S, +) with (S’, ¥’) andy: S’ > S” is anisomorphism of (S’, *’) 
with (5”, *”), then the composite function y o ¢ is an isomorphism of (S, *) with (S”, *”). 

Prove that the relation ~ of being isomorphic, described in Definition 3.7, is an equivalence relation on any set 
of binary structures. You may simply quote the results you were asked to prove in the preceding two exercises at 
appropriate places in your proof. 


In Exercises 29 through 32, give a careful proof for a skeptic that the indicated property of a binary structure (S, *) 
is indeed a structural property. (In Theorem 3.14, we did this for the property, “There is an identity element for *,.”) 


29. 
30. 
31. 
32. 
33. 


34. 


The operation * is commutative. 
The operation « is associative. 


For each ¢ € S, the equation x * x = c has a solution x in S. 


There exists an clement b in S such that b * b = b. 
; | for a,b € R. Exercise 23 of 
Section 2 shows that H is closed under both matrix addition and matrix multiplication. 


Let H be the subset of M2(R) consisting of all matrices of the form [ 


a. Show that (C, +) is isomorphic to (H, +). 
b. Show that (C, -) is isomorphic to (H, -). 


(We say that H is a matrix representation of the complex numbers C.) 


There are 16 possible binary structures on the set {a, b} of two elements. How many nonisomorphic (that is, 
structurally different) structures are there among these 16? Phrased more precisely in terms of the isomorphism 
equivalence relation ~ on this set of 16 structures, how many equivalence classes are there? Write down one 
structure from each equivalence class. [Hint: Interchanging a and b everywhere in a table and then rewriting 
the table with elements listed in the original order does not always yield a table different from the one we 
started with. ] 


GROUPS 


Let us continue the analysis of our past experience with algebra. Once we had mastered 
the computational problems of addition and multiplication of numbers, we were ready 
to apply these binary operations to the solution of problems. Often problems lead to 
equations involving some unknown number x, which is to be determined. The simplest 
equations are the linear ones of the forms a + x = b for the operation of addition, and 
ax = b for multiplication. The additive linear equation always has a numerical solution, 
and so has the multiplicative one, provided a # 0. Indeed, the need for solutions of 
additive linear equations such as 5 + x = 2 is a very good motivation for the negative 
numbers. Similarly, the need for rational numbers is shown by equations such as 2x = 3. 

It is desirable for us to be able to solve linear equations involving our binary opera- 
tions. This is not possible for every binary operation, however. For example, the equation 
a*x =a has no solution in S = {a, b, c} for the operation * of Example 2.14. Let us 
abstract from familiar algebra those properties of addition that enable us to solve the 
equation 5 + x =2 in Z. We must not refer to subtraction, for we are concerned with the 
solution phrased in terms of a single binary operation, in this case addition. The steps in 
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the solution are as follows: 


SxS 2. given, 
—5+(5+x)=-54+2, adding —5, 
(-5+5)+x =-5+2, associative law, 

O+x =-5+2, computing —5+5, 

x =—5+42, property of 0, 
x= 3, computing — 5 + 2. 


Strictly speaking, we have not shown here that —3 is a solution, but rather that itis the only 
possibility for a solution. To show that —3 is a solution, one merely computes 5 + (—3). 
A similar analysis could be made for the equation 2x = 3 in the rational numbers with 
the operation of multiplication: 


2x = 3, given, 
5(2x) = $(3), multiplying by 5, ‘< 
G x = 33, associative law, 
1-x = 43, computing 52, 
x= 53, property of 1, 
a computing $3. 


We can now see what properties a set § and a binary operation * on S would have to 
have to permit imitation of this procedure for an equation a * x = b for a, b € S. Basic 
to the procedure is the existence of an element e in S with the property that e + x = x 
for all x € S. For our additive example, 0 played the role of e, and 1 played the role for 
our multiplicative example. Then we need an element a’ in S§ that has the property that 
a’ *a = e. For our additive example with a = 5, —5 played the role of a’, and 5 played 
the role for our multiplicative example with a = 2. Finally we need the associative law. 
The remainder is just computation. A similar analysis shows that in order to solve the 
equation x * a = b (remember that a « x need not equal x * a), we would like to have an 
element e in S such that x * e = x for allx ¢ S$ and ana’ in S such thata x a’ = e. With 
all of these properties of x on S, we could be sure of being able to solve linear equations. 
Thus we need an associative binary structure (S, «) with an identity element e such that 
for each a € S, there exists a’ € S such that a * a’ = a’ x a = ¢. This is precisely the 
notion of a group, which we now define. 


Definition and Examples 


Rather than describe a group using terms defined in Sections 2 and 3 as we did at the end 
of the preceding paragraph, we give a self-contained definition. This enables a person 
who picks up this text to discover what a group is without having to look up more terms. 


A group (G, x) is a set G, closed under a binary operation +, such that the following 
axioms are satisfied: 
%,: For all a, b, c € G, we have 


(ax b)*c =ax*(bxc). associativity of x 


38 


PartI Groups and Subgroups 
GY,; There is an element e in G such that for all x € G, 
exx=xx*xe=x. identity element e for « 
Y,; Corresponding to each a € G, there is an element a’ in G such that 
axa’ =a'*xa=e. inversed ofa | 
4.2 Example We easily see that (U, +) and (U,, -) are groups. Multiplication of complex numbers is 


associative and both U and U, contain 1, which is an identity for multiplication. For 
e® € U, the computation 


. igaall aren 
ei? . ei 2x 6) et =] 


shows that every element of U has an inverse. For z € U,,, the computation 


shows that every element of U,, has an inverse. Thus {U,-) and (U,,-) are groups. 
Because (R,., +.) is isomorphic to (U, -), we see that (R,, +c) is a group forallc € Rt. 
Similarly, the fact that (Z,, +n) is isomorphic to (U,, -) shows that (Z,, +n} 1s a group 
foralln € Z*. A 


We point out now that we will sometimes be sloppy in notation. Rather than use 
the binary structure notation (G, *) constantly, we often refer to a group G, with the 
understanding that there is of course a binary operation on the set G. In the event that 
clarity demands that we specify an operation * on G, we use the phrase “the group G 


#@ Historicat NOTE 


here are three historical roots of the develop- 

ment of abstract group theory evident in the 
mathematical literature of the nineteenth century: 
the theory of algebraic equations, number theory, 
and geometry. All three of these areas used group- 
theoretic methods of reasoning, although the meth- 
ods were considerably more explicit in the first area 
than in the other two. 

One of the central themes of geometry in the 
nineteenth century was the search for invariants 
under various types of geometric transformations. 
Gradually attention became focused on the trans- 
formations themselves, which in many cases can be 
thought of as elements of groups. 

In number theory, already in the eighteenth cen- 
tury Leonhard Euler had considered the remainders 
on division of powers a” by a fixed prime p. These 
remainders have “group” properties. Similarly, 


Carl KF Gauss, in his Disguisitiones Arithmeti- 
cae (1800), dealt extensively with quadratic forms 
ax? +2bxy + cy’, and in particular showed that 
equivalence classes of these forms under compo- 
sition possessed what amounted to group proper- 
ties. 

Finally, the theory of algebraic equations pro- 
vided the most explicit prefiguring of the group con- 
cept. Joseph-Louis Lagrange (1736-1813) in fact 
initiated the study of permutations of the roots of an 
equation as a tool for solving it. These permutations, 
of course, were ultimately considered as elements 
of a group. 

It was Walter von Dyck (1856-1934) and 
Heinrich Weber (1842-1913) who in 1882 were 
able independently to combine the three historical 
roots and give clear definitions of the notion of an 
abstract group. 


under «.” For example, we may refer to the groups Z, Q, and R under addition rather 
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than write the more tedious (Z, +), (Q, +), and (R, +). However, we feel free to refer 
to the group Zg without specifying the operation. ~ 


4.3 Definition 


A group G is abelian if its binary operation is commutative. 


@ HIstToricaL NOTE 


CE groups are called abelian in honor 
of the Norwegian mathematician Niels Henrik 
Abel (1802-1829). Abel was interested in the ques- 
tion of solvability of polynomial equations. In a pa- 
per written in 1828, he proved that if all the roots 
of such an equation can be expressed as rational 
functions f,g,...,# of one of them, say x, and 
if for any two of these roots, f(x) and g(x), the 
relation f(g(x)) = g(f(x)) always holds, then the 
equation is solvable by radicals. Abel showed that 
each of these functions in fact permutes the roots of 
the equation; hence, these functions are elements 
of the group of permutations of the roots. It was 
this property of commutativity in these permuta- 
tion groups associated with solvable equations that 
led Camille Jordan in his 1870 treatise on alge- 
bra to name such groups abelian; the name since 


then has been applied to commutative groups in 
general. 

Abel was attracted to mathematics as a teenager 
and soon surpassed all his teachers in Norway. He 
finally received a government travel grant to study 
elsewhere in 1825 and proceeded to Berlin, where 
he befriended August Crelle, the founder of the most 
influential German mathematical journal. Abel con- 
tributed numerous papers to Crelle’s Journal during 
the next several years, including many in the field 
of elliptic functions, whose theory he created vir- 
tuatly single-handedly. Abel returned to Norway in 
1827 with no position and an abundance of debts. 
He nevertheless continued to write brilliant papers, 
but died of tuberculosis at the age of 26, two days 
before Crelle succeeded in finding a university po- 
sition for him in Berlin. 


4.4 Example 


4.5 Example 


4.6 Example 


4,7 Example 


4.8 Example 


Let us give some examples of some sets with binary operations that give groups and 
also of some that do not give groups. 


The set Z* under addition is not a group. There is no identity element for-+inZ*. A 


The set of all nonnegative integers (including 0) under addition is still not a group. There 
is an identity element 0, but no inverse for 2. A 


The familiar additive properties of integers and of rational, real, and complex numbers 
show that Z, Q, IR, and C under addition are abelian groups. A 


The set Z* under multiplication is not a group. There is an identity 1, but no inverse 
of 3. A 


The familiar multiplicative properties of rational, real, and complex numbers show that 
the sets Q* and Rt of positive numbers and the sets Q*, R*, and C* of nonzero numbers 
under multiplication are abelian groups. A 
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4.9 Example 


4.10 Example 


4.11 Example 


4.12 Example 


4.13 Example 


Solution 


4.14 Example 
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The set of all real-valued functions with domain IR under function addition is a group. 
This group is abelian. A 


(Linear Algebra) Those who have studied vector spaces should note that the axioms 
for a vector space V pertaining just to vector addition can be summarized by asserting 
that V under vector addition is an abelian group. A 


The set Mnyn(R) of all m x n matrices under matrix addition is a group. The m x n 
matrix with all entries 0 is the identity matrix. This group is abelian. A 


The set M,,(R) of all n x n matrices under matrix multiplication is not a group. The 
n x n matrix with all entries 0 has no inverse. A 


Show that the subset S of M,,(R) consisting of all invertible n x n matrices under matrix 
multiplication is a group. 


We start by showing that S is closed under matrix multiplication. Let A and B be in S, 
so that both A~! and B~! exist and AA7! = BB" = I,. Then 


(AB)(B-'A7!) = A(BB“1)A7! = ALA! = In, 


so that AB is invertible and consequently is also in S. 

Since matrix multiplication is associative and J,, acts as the identity element, and 
since each element of S has an inverse by definition of S, we see that S is indeed a group. 
This group is nof commutative. It is our first example of a nonabelian group. A 


The group of invertible n x n matrices described in the preceding example is of 
fundamental importance in linear algebra. It is the general linear group of degree n, 
and is usually denoted by GL(n, R). Those of you who have studied linear algebra know 
that a matrix A in GL(n, R) gives rise to an invertible linear transformation T : R" —> 
IR", defined by T(x) = Ax, and that conversely, every invertible linear transformation 
of R® into itself is defined in this fashion by some matrix in GL(n, IR). Also, matrix 
multiplication corresponds to composition of linear transformations. Thus all invertible 
linear transformations of IR” into itself form a group under function composition; this 
group is usually denoted by GL(R"). Of course, GL(n, R) x GL(R"). 


Let « be defined on Qt by a * b = ab/2. Then 


b b 
ies eS 
2 4 
and likewise 
pepeeege Se. 
2 4 


Thus * is associative. Computation shows that 
2*a=ax2=a 
for all a € Qt, so 2 is an identity element for +. Finally, 
4 
a*x*—-=-—*a=2, 
aoa 
so a’ = 4/a is an inverse for a. Hence Qt with the operation * is a group. A 


4.15 Theorem 


Proof 


4.16 Theorem 


Proof 
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Elementary Properties of Groups ~ 


As we proceed to prove our first theorem about groups, we must use Definition 4.1, which 
is the only thing we know about groups at the moment. The proof of a second theorem 
can employ both Definition 4.1 and the first theorem; the proof of a third theorem can 
use the definition and the first two theorems, and so on. 

Our first theorem will establish cancellation laws. In real arithmetic, we know that 
2a = 2b implies that a = b. We need only divide both sides of the equation 2a = 2b 
by 2, or equivalently, multiply both sides by ;, which is the multiplicative inverse of 2. 
We parrot this proof to establish cancellation laws for any group. Note that we will also 
use the associative law. 


If G is a group with binary operation x, then the left and right cancellation laws 
hold in G, that is, a* b =ax*c implies b =c, and bx a =c *a implies b = c for all 
a,b,céG. 


Suppose a * b = a xc. Then by &, there exists a’, and 

a’x(a*b) =a’ *(axc). 
By the associative law, 

(a’ xa)*xb = (a *a) xc. 
By the definition of a’ in %, a' *a =e, so 

exb=exc. 
By the definition of e in %, 
b=c. 


Similarly, from b x a = c x a one can deduce that b = c upon multiplication on the right 
by a’ and use of the axioms for a group. Ad 


Our next proof can make use of Theorem 4.15. We show that a “linear equation” in 
a group has a unique solution. Recall that we chose our group properties to allow us to 
find solutions of such equations. 


If G is a group with binary operation *, and if a and b are any elements of G, then the 
linear equations a * x = b and y xa = b have unique solutions x and y in G. 


First we show the existence of at least one solution by just computing that a’ * b is a 
solution of a * x = b. Note that 


a*(a’*xb)=(axa')*b, associative law, 
=exh, definition of a’, 
= b, property of e. 


Thus x = a’ x bis a solution of a * x = b. Ina similar fashion, y = b * a’ is a solution 
of yxa=b. 


42 


Part I 


4.17 Theorem 


Proof 


4.18 Corollary 
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To show uniqueness of y, we use the standard method of assuming that we have 
two solutions, y, and y, so that y, *a = 6 and y. *a = b. Then y; a = yz ¥ a, and 
by Theorem 4.15, y; = y2. The uniqueness of x follows similarly. . 


Of course, to prove the uniqueness in the last theorem, we could have followed the 
procedure we used in motivating the definition of a group, showing that ifa «x = b, 
then x — a’ « b. However, we chose to illustrate the standard way to prove an object is 
unique; namely, suppose you have two such objects, and then prove they must be the 
same. Note that the solutions x = a’ « b and y = b x a’ need not be the same unless * is 
commutative. 

Because a group is a special type of binary structure, we know from Theorem 3.13 
that the identity ¢ in a group is unique. We state this again as part of the next theorem 
for easy reference. 


In a group G with binary operation *, there is only one element e in G such that 
exxSxxe=X 
for all x © G. Likewise for each a € G, there is only one element a’ in G such that 
a *xa=a*a =e. 
In summary, the identity element and inverse of each element are unique in a group. 


Theorem 3.13 shows that an identity element for any binary structure is unique. No use 
of the group axioms was required to show this. 

Turning to the uniqueness of an inverse, suppose that a € G has inverses a’ and a” 
so that a’ ka =axa' =e anda” xa =axa" =e. Then 


axa’ =axa =e 


and, by Theorem 4.15, 


so the inverse of a in a group is unique. ¢ 
Note that in a group G, we have 
(axb)x(b' xa )=ax(beb) «a =(axelxd =axd' =e. 


This equation and Theorem 4.17 show that b’ x a’ is the unique inverse of a * b. 
That is, (a * bY = b’ « a’. We state this as a corollary. 


Let G be a group. For all a, b € G, we have (a * bY =D’ xa’. 


For your information, we remark that binary algebraic structures with weaker axioms 
than those for a group have also been studied quite extensively. Of these weaker structures, 
the semigroup, a set with an associative binary operation, has perhaps had the most 
attention. A monoid is a semigroup that has an identity element for the binary operation. 
Note that every group is both a semigroup and a monoid. 
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Finally, it is possible to give axioms for a group (G, *) that seem at first glance to 
be weaker, namely: 


1. The binary operation « on G is associative. 
2. There exists a left identity element ¢ in G such that e « x = x forallx € G. 


3. For eacha é€ G, there exists a left inverse a’ in G such that a’ «a = e. 


From this one-sided definition, one can prove that the left identity element is also a right 
identity element, and a left inverse is also a right inverse for the same element. Thus 
these axioms should not be called weaker, since they result in exactly the same structures 
being called groups. It is conceivable that it might be easier in some cases to check these 
left axioms than to check our two-sided axioms. Of course, by symmetry it is clear that 
there are also right axioms for a group. 


Finite Groups and Group Tables 


All our examples after Example 4.2 have been of infinite groups, that is, groups where 
the set G has an infinite number of elements. We turn to finite groups, starting with the 
smallest finite sets. 

Since a group has to have at least one element, namely, the identity, a minimal set that 
might give rise to a group is a one-element set {e}. The only possible binary operation * 
on {e} is defined by e x e =e. The three group axioms hold. The identity element is 
always its own inverse in every group. 

Let us try to put a group structure on a set of two elements. Since one of the elements 
must play the role of identity element, we may as well let the set be {e, a}. Let us attempt 
to find a table for a binary operation « on {e, a} that gives a group structure on {e, a}. 
When giving a table for a group operation, we shall always list the identity first, as in 
the following table. 


Since e is to be the identity, so 
exxX=xX*E=X 


for all x € {e, a}, we are forced to fill in the table as follows, if x is to give a group: 


a 


Also, a must have an inverse a’ such that 


axa =a *a=e. 
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In our case, a’ must be either e or a. Since a’ = e obviously does not work, we must 
have a’ = a, so we have to complete the table as follows: 


All the group axioms are now satisfied, except possibly the associative property. Check- 
ing associativity on a case-by-case basis from a table defining an operation can be a 
very tedious process. However, we know that Z = {0, 1} under addition modulo 2 is 
a group, and by our arguments, its table must be the one above with e replaced by 0 
and a by 1. Thus the associative property must be satisfied for our table containing e 
and a. 

With this example as background, we should be able to list some necessary conditions 
that a table giving a binary operation on a finite set must satisfy for the operation to give 
a group structure on the set. There must be one element of the set, which we may as well 
denote by e, that acts as the identity element. The condition e * x = x means that the row 
of the table opposite e at the extreme left must contain exactly the elements appearing 
across the very top of the table in the same order. Similarly, the condition x *¢ = x 
means that the column of the table under e at the very top must contain exactly the 
elements appearing at the extreme left in the same order. The fact that every element a 
has a right and a left inverse means that in the row having a at the extreme left, the 
element e must appear, and in the column under a at the very top, the e must appear. 
Thus e must appear in each row and in each column. We can do even better than this, 
however. By Theorem 4.16, not only the equations a * x = e and y * a = e have unique 
solutions, but also the equations a * x = b and y *a = b. By a similar argument, this 
means that each element b of the group must appear once and only once in each row and 
each column of the table. 

Suppose conversely that a table for a binary operation on a finite set is such that 
there is an element acting as identity and that in each row and each column, each element 
of the set appears exactly once. Then it can be seen that the structure is a group structure 
if and only if the associative law holds. If a binary operation * is given by a table, 
the associative law is usually messy to check. If the operation * is defined by some 
characterizing property of a * b, the associative law is often easy to check. Forenaely, 
this second case turns out to be the one usually encountered. 

We saw that there was essentially only one group of two elements in the sense that 
if the elements are denoted by e and a with the identity clement e appearing first, the 
table must be shown in Table 4.19. Suppose that a set has three elements. As before, we 
may as well let the set be {e, a, b}. For e to be an identity element, a binary operation 
* on this set has to have a table of the form shown in Table 4.20. This leaves four 
places to be filled in. You can quickly see that Table 4.20 must be completed as shown 
in Table 4.21 if each row and cach column are to contain each element exactly once. 
Because there was only one way to complete the table and Z3 = {0, 1, 2} under addition 
modulo 3 is a group, the associative property must hold for our table containing e, a, 
and b. 
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Now suppose that G’ is any other group of three elements and imagine a table for G’ 
with identity element appearing first. Since our filling out of the table for G = {e, a, b} 
could be done in only one way, we see that if we take the table for G’ and rename the 
identity e, the next element listed a, and the last element b, the resulting table for G’ 
must be the same as the one we had for G. As explained in Section 3, this renaming 
gives an isomorphism of the group G’ with the group G. Definition 3.7 defined the 
notion of isomorphism and of isomorphic binary structures. Groups are just certain 
types of binary structures, so the same definition pertains to them. Thus our work above 
can be summarized by saying that all groups with a single element are isomorphic, all 
groups with just two elements are isomorphic, and all groups with just three elements are 
isomorphic. We use the phrase up to isomorphism to express this identification using the 
equivalence relation ~. Thus we may say, “There is only one group of three elements, 
up to isomorphism.” 


4.19 Table 4.20 Table 4.21 Table 


EXERCISES 4 


Computations 


In Exercises 1 through 6, determine whether the binary operation « gives a group structure on the given set. If no 
group results, give the first axiom in the order &, ‘4, G from Definition 4.1 that does not hold. 


SPAIN mH B&H NY KE 


10. 


. Let * be defined on Z by letting a « b = ab. 


Let * be defined on 2Z = {2n|n € Z} by lettingaxb=a+b. 


. Let * be defined on R* by letting a * b = ab. 

. Let * be defined on Q by letting a * b = ab. 

. Let * be defined on the set IR* of nonzero real numbers by letting a * b = a/b. 
. Let « be defined on C by letting a « b = jab}. 

. Give an example of an abelian group G where G has exactly 1000 elements. 


. We can also consider multiplication -, modulo n in Z,. For example, 5 -7 6 = 2 in Z because 5-6 = 30 = 


4(7) + 2. The set {1, 3, 5, 7} with multiplication -g modulo 8 is a group. Give the table for this group. 


. Show that the group (U, -) is not isomorphic to either (R, +) or (R*, -). (All three groups have cardinality |R|.) 


Let n be a positive integer and let nZ = {nm|m € Z}. 


a. Show that (nZ, +) is a group. 
b. Show that (nZ, +) ~ (Z, +). 
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In Exercises 11 through 18, determine whether the given set of matrices under the specified operation, matrix 
addition or multiplication, is a group. Recall that a diagonal matrix is a square matrix whose only nonzero entries 
lie on the main diagonal, from the upper left to the lower right corner. An upper-triangular matrix is a square 
matrix with only zero entries below the main diagonal. Associated with each n x n matrix A is a number called 
the determinant of A, denoted by det(A). If A and B are bothn x n matrices, then det(AB) = det(A) det(B). Also, 
det(Z,) = 1 and A is invertible if and only if det(A) 4 0. 


11. 
12. 
13. 
14. 
15. 
16. 
17. 
18. 
19. 


20. 


21. 


Alln x n diagonal matrices under matrix addition. 

All n x n diagonal matrices under matrix multiplication. 

Alln x n diagonal matrices with no zero diagonal entry under matrix multiplication. 
Alln x n diagonal matrices with all diagonal entries 1 or —1 under matrix multiplication. 
Alln x n upper-triangular matrices under matrix multiplication. 

All n x n upper-triangular matrices under matrix addition. 

Alln x n upper-triangular matrices with determinant 1 under matrix multiplication. 

All n x n matrices with determinant either 1 or —1 under matrix multiplication. 


Let S be the set of all real numbers except —1. Define * on S by 


axb=a+b-+ab. 


a. Show that « gives a binary operation on S. 
b. Show that (5, *) is a group. 
ce. Find the solution of the equation 2 * x *3 = 7 in S. 


This exercise shows that there are two nonisomorphic group structures on a set of 4 elements. 

Let the set be {e, a, b, c}, with e the identity element for the group operation. A group table would then have 
to start in the manner shown in Table 4.22. The square indicated by the question mark cannot be filled in with 
a. It must be filled in either with the identity element e or with an element different from both e and a. In this 
latter case, it is no loss of generality to assume that this element is b. If this square is filled in with e, the table 
can then be completed in two ways to give a group. Find these two tables. (You need not check the associative 
law.) If this square is filled in with b, then the table can only be completed in one way to give a group. Find this 
table. (Again, you need not check the associative law.) Of the three tables you now have, two give isomorphic 
groups. Determine which two tables these are, and give the one-to-one onto renaming function which is an 
isomorphism. 


a. Are all groups of 4 elements commutative? 

b. Which table gives a group isomorphic to the group U4, so that we know the binary operation defined by the 
table is associative? 

c. Show that the group given by one of the other tables is structurally the same as the group in Exercise 14 for 
one particular value of n, so that we know that the operation defined by that table is associative also. 


According to Exercise 12 of Section 2, there are 16 possible binary operations on a set of 2 elements. How 
many of these give a structure of a group? How many of the 19,683 possible binary operations on a set of 
3 elements give a group structure? 


Concepts 


22. 


Consider our axioms &, G, and Y, for a group. We gave them in the order 4 %%. Conceivable other 
orders to state the axioms are FAY, GG, GAG, GFA, wd FFF. Of these six possible 
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orders, exactly three are acceptable for a definition. Which orders are not acceptable, and why? (Remember 
this. Most instructors ask the student to define a group on at least one test.) 


4.22 Table 


23. The following “definitions” of a group are taken verbatim, including spelling and punctuation, from papers of 
students who wrote a bit too quickly and carelessly. Criticize them. 


a. A group G is a set of elements together with a binary operation * such that the following conditions are 


Cc 


satisfied 
* iS associative 
There exists e € G such that 


e*xx =x ke =x = identity. 


For every a € G there exists an a’ (inverse) such that 


. A group is a set G such that 


The operation on G is associative. 

there is an identity element (e) in G. 

for every a € G, there is an a’ (inverse for each element) 
A group is a set with a binary operation such 

the binary operation is defined 

an inverse exists 

an identity element exists 


. A set G is called a group over the binery operation « such that for alla, b € G 


Binary operation * is associative under addition 
there exist an element {e} such that 


axe=exaz=e 
Fore every element a there exists an element a’ such that 


axa =a *xa=e 


24. Give a table for a binary operation on the set {e, a, b} of three elements satisfying axioms Y% and & for a 
group but not axiom &. 


25. Mark each of the following true or false. 


a. A group may have more than one identity element. 
b. Any two groups of three elements are isomorphic. 
c. In a group, each linear equation has a solution. 
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d. The proper attitude toward a definition is to memorize it so that you can reproduce it word for word 
as in the text. 

e. Any definition a person gives for a group is correct provided that everything that is a group by that 
person’s definition is also a group by the definition in the text. 

f. Any definition a person gives for a group is correct provided he or she can show that everything 
that satisfies the definition satisfies the one in the text and conversely. 


g. Every finite group of at most three elements is abelian. 

h. An equation of the form a * x * b = always has a unique solution in a group. 
i. The empty set can be considered a group. 

j. Every group is a binary algebraic structure. 


Proof synopsis 


We give an example of a proof synopsis. Here is a one-sentence synopsis of the proof that the inverse of an element 
a ina group (G, *) is unique. 


Assuming that a « a’ = e anda * a" =e, apply the left cancellation Jaw to the equation a * a =axa", 


Note that we said “the left cancellation law” and not “Theorem 4.15.” We always suppose that our synopsis was 
given as an explanation given during a conversation at lunch, with no reference to text numbering and as little 
notation as is practical. 


26. Give a one-sentence synopsis of the proof of the left cancellation law in Theorem 4.15. 


27, Give at most a two-sentence synopsis of the proof in Theorem 4.16 that an equation ax = b has a unique 
solution in a group. 


Theory 


28, From our intuitive grasp of the notion of isomorphic groups, it should be clear that if 6: G > G’ is a group 
isomorphism, then #(e) is the identity e’ of G’. Recall that Theorem 3.14 gave a proof of this for isomorphic 
binary structures (S, *) and (S’, *’). Of course, this covers the case of groups. 

It should also be intuitively clear that if a and a’ are inverse pairs in G, then ¢(a) and o(a’) are inverse pairs 
in G’, that is, that (ay = (a’). Give a careful proof of this for a skeptic who can’t see the forest for all the 
trees. 

29. Show that if G is a finite group with identity e and with an even number of elements, then there isa#~einG 
such thata *a =e. 


30. Let R* be the set of all real numbers except 0. Define « on R* by letting a « b = |alb. 
a. Show that * gives an associative binary operation on R”*. 
b. Show that there is a left identity for + and a right inverse for each element in R*. 
c. Is R* with this binary operation a group? 
d. Explain the significance of this exercise. 
31. If * is a binary operation on a set S, an element x of S is an idempotent for « if x * x = x. Prove that a group 
has exactly one idempotent element. (You may use any theorems proved so far in the text.) 


32. Show that every group G with identity ¢ and such that x * x =e for all x € G is abelian. [Hint: Consider 
(a * b) * (a * b).] 


33. 


34. 


35. 


36. 
37. 
38. 


39, 


40. 


41. 
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Let G be an abelian group and let c” =c*cx*---%c for n factors c, where c € G and n € Z*. Give a 
mathematical induction proof that (a * by” = (a”) x (b") for alla, b € G. 

Let G be a group with a finite number of elements. Show that for any a € G, there exists an n € Z~ such that 
a” = e. See Exercise 33 for the meaning of a”. [Hint: Consider e, a, a’,a?,...,a™, where m is the number 
of elements in G, and use the cancellation laws.] 

Show that if (a x b)? = a? « b* for a and b ina group G, then a * b = b x a. See Exercise 33 for the meaning 
of a’. 

Let G be a group and let a, b € G. Show that (a « b)’ = a’ « b’ if and only ifa xb = ba. 

Let G be a group and suppose that ad * b* c = e fora, b,c € G. Show that b *c *a = e also. 

Prove that a set G, together with a binary operation * on G satisfying the left axioms 1, 2, and 3 given on 
page 43, is a group. 


Prove that a nonempty set G, together with an associative binary operation * on G such that 
axx = band yx*a = b have solutions in G for alla, b € G, 


is a group. [Hint: Use Exercise 38.] 
Let (G, -) be a group. Consider the binary operation * on the set G defined by 


axkb=b-a 


for a, b € G. Show that (G, *) is a group and that (G, x) is actually isomorphic to (G, -). [Hint: Consider the 
map ¢ with d(a) = a’ fora € G.] 

Let G be a group and let g be one fixed element of G. Show that the map i,, such that i,(x) = gxg’ forx € G, 
is an isomorphism of G with itself. 


SUBGROUPS 


Notation and Terminology 


It is time to explain some conventional notation and terminology used in group theory. 
Algebraists as a rule do not use a special symbol « to denote a binary operation different 
from the usual addition and multiplication. They stick with the conventional additive or 
multiplicative notation and even call the operation addition or multiplication, depending 
on the symbol used. The symbol for addition is, of course, +, and usually multiplication 
is denoted by juxtaposition without a dot, if no confusion results. Thus in place of the 
notation a * b, we shall be using either a + 6 to be read “the sum of a and b,” or ab 
to be read “the product of a and b.” There is a sort of unwritten agreement that the 
symbol + should be used only to designate commutative operations. Algebraists feel 
very uncomfortable when they see a + b # b + a. For this reason, when developing our 
theory in a general situation where the operation may or may not be commutative, we 
shall always use multiplicative notation. 

Algebraists frequently use the symbol 0 to denote an additive identity element and 
the symbol | to denote a multiplicative identity element, even though they may not be 
actually denoting the integers 0 and 1. Of course, if they are also talking about numbers 
at the same time, so that confusion would result, symbols such as e or u ate used as 
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5.2 Table 


5.3 Definition 


5.4 Definition 
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identity elements. Thus a table for a group of three elements might be one like Table 5.1 
or, since such a group is commutative, the table might look like Table 5.2. In general 
situations we shall continue to use e to denote the identity element of a group. 

It is customary to denote the inverse of an element a in a group by a~! in multi- 
plicative notation and by —a in additive notation. From now on, we shall be using these 
notations in place of the symbol a’. 

Let n be a positive integer. If a is an element of a group G, written multiplicatively, 
we denote the product aaa... a for n factors a by a”. We let a® be the identity element 
e, and denote the product a~!a~!a7!...a7! for n factors by a~". It is easy to see that 
our usual law of exponents, aa” = a™™ form,n € Z, holds. Form,n € Z*, itis clear. 
We illustrate another type of case by an example: 


fae = — ah eat oo ime. 
aa? =a ‘a 'aaaaa =a‘(a 'q)aaaa = a7'eaaaa = a ‘(ea)aaa 


=a 'aaaa = (a ta)aaa = edda = (ea)aa = daa = a. 
In additive notation, we denote a+a+a-+---+a for m summands by na, denote 
(—a) + (-a) + (-a) +--+ + (—a) for n summands by —na, and let Oa be the identity 
element. Be careful: In the notation na, the number n is in Z, not in G. One reason 
we prefer to present group theory using multiplicative notation, even if G is abelian, 
is the confusion caused by regarding n as being in G in this notation na. No one ever 
misinterprets the n when it appears in an exponent. 
Let us explain one more term that is used so often it merits a special definition. 


If G is a group, then the order |G| of G is the number of elements in G. (Recall from 
Section 0 that, for any set S, |S| is the cardinality of S.) | 


Subsets and Subgroups 


You may have noticed that we sometimes have had groups contained within larger 
groups. For example, the group Z under addition is contained within the group Q under 
addition, which in turn is contained in the group R under addition. When we view the 
group (Z, +) as contained in the group (IR, +), it is very important to notice that the 
operation + on integers n and m as elements of (Z, +) produces the same elementn + m 
as would result if you were to think of n and m as elements in (R, +-). Thus we should 
not regard the group (Q*, -) as contained in (R, +), even though Q” is contained in R as 
a set. In this instance, 2-3 = 6 in (Qt, -), while 2 +3 = 5 in (R, +). We are requiring 
not only that the set of one group be a subset of the set of the other, but also that the 
group operation on the subset be the induced operation that assigns the same element 
to each ordered pair from this subset as is assigned by the group operation on the whole 
set. 


If a subset H of a group G is closed under the binary operation of G and if H with the 
induced operation from G is itself a group, then H is a subgroup of G. We shall let 
H < GorG > H denote that H is a subgroup of G, and H < G or G > H shall mean 
H<GbutH 4G. a 


5.5 Definition 


5.6 Example 


5.7 Example 


5.8 Example 


5.9 Example 
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Thus (Z, +) < (R, +) but (Q*, -) is not a subgroup of (R, +), even though as sets, 
Q* Cc R. Every group G has as subgroups G itself and {e}, where ¢ is the identity element 
of G. 


If G is a group, then the subgroup consisting of G itself is the improper subgroup of G. 
All other subgroups are proper subgroups. The subgroup {e} is the trivial subgroup 
of G. All other subgroups are nontrivial. a 


We turn to some illustrations. 


Let R” be the additive group of all n-component row vectors with real number entries. 
The subset consisting of all of these vectors having 0 as entry in the first component is 
a subgroup of R”. A 


+ under multiplication is a proper subgroup of R* under multiplication. A 


The nth roots of unity in C form a subgroup U,, of the group C* of nonzero complex 
numbers under multiplication. A 


There are two different types of group structures of order 4 (see Exercise 20 of Section 4). 
We describe them by their group tables (Tables 5.10 and 5.11). The group V is the Klein 
4-group, and the notation V comes from the German word Vier for four. The group 
Z4 is isomorphic to the group Us = {1, i, —1, —i} of fourth roots of unity under multi- 
plication. 

The only nontrivial proper subgroup of Z, is {0,2}. Note that {0, 3} is nota subgroup 
of Z4, since {0, 3} is not closed under +. For example, 3+ 3 = 2, and 2 ¢ {0, 3}. 
However, the group V has three nontrivial proper subgroups, {e, a}, {e, b}, and {e, c}. 
Here {e, a, b} is not a subgroup, since {e, a, b} is not closed under the operation of V 
because ab = c, andc ¢ {e, a, b}. A 


5.10 Table 5.11 Table 


24: 


It is often useful to draw a subgroup diagram of the subgroups of a group. In such 
a diagram, a line running downward from a group G to a group H means that H isa 
subgroup of G. Thus the larger group is placed nearer the top of the diagram. Figure 5.12 
contains the subgroup diagrams for the groups Z4 and V of Example 5.9. 
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5.13 Example 


5.14 Theorem 


Proof 


5.15 Example 


Groups and Subgroups 


Note that if H < G and a € H, then by Theorem 4.16, the equation ax = a must 
have a unique solution, namely the identity element of H. But this equation can also 
be viewed as one in G, and we see that this unique solution must also be the identity 
element e of G. A similar argument then applied to the equation ax = e, viewed in both 
H and G, shows that the inverse a~! of a in G is also the inverse of a in the subgroup H. 


Za V 
{0, 2} {e, a} {e, b} fe, c} 


0} {e} 


(a) (b) 


5.12 Figure (a) Subgroup diagram for Z,. (b) Subgroup diagram for V. 


Let F be the group of all real-valued functions with domain R under addition. The 
subset of F consisting of those functions that are continuous is a subgroup of F’,, for 
the sum of continuous functions is continuous, the function f where f(x) = 0 for all 
x is continuous and is the additive identity element, and if f is continuous, then —/f is 
continuous. A 


It is convenient to have routine steps for determining whether a subset of a group G 
is a subgroup of G. Example 5.13 indicates such a routine, and in the next theorem, we 
demonstrate carefully its validity. While more compact criteria are available, involving 
only one condition, we prefer this more transparent theorem for a first course. 


A subset H of a group G is a subgroup of G if and only if 


1. H is closed under the binary operation of G, 
2. the identity element e of G is in H, 
3. foralla € H itis true that a”! € H also. 


The fact that if H < G then Conditions 1, 2, and 3 must hold follows at once from the 
definition of a subgroup and from the remarks preceding Example 5.13. 

Conversely, suppose H is a subset of a group G such that Conditions 1, 2, and 3 hold. 
By 2 we have at once that & is satisfied. Also % is satisfied by 3. It remains to check 
the associative axiom, Y. But surely for all a, b, c € H itis true that (ab)c = a(be) in 
H, for we may actually view this as an equation in G, where the associative law holds. 
Hence H < G. Sd 


Let F be as in Example 5.13. The subset of F consisting of those functions that are 
differentiable is a subgroup of F, for the sum of differentiable functions is differentiable, 
the constant function 0 is differentiable, and if f is differentiable, then —/f is differen- 
tiable. A 


5.16 Example 


5.17 Theorem 
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Recall from linear algebra that every square matrix A has associated with it a number 
det(A) called its determinant, and that A is invertible if and only if det(A) 4 0. If A and B 
are square matrices of the same size, then it can be shown that det(A B) = det(A) - det(B). 
Let G be the multiplicative group of all invertible n x n matrices with entries in C and 
let T be the subset of G consisting of those matrices with determinant 1. The equation 
det(A B) = det(A) - det(B) shows that T is closed under matrix multiplication. Recall 
that the identity matrix J, has determinant 1. From the equation det(A) - det(A7!) = 
det(AA~!) = det(J,) = 1, we see that if det(A) = 1, then det(A~!) = 1. Theorem 5.14 
then shows that T is a subgroup of G. A 


Cyclic Subgroups 


Let us see how large a subgroup H of Z;. would have to be if it contains 3. It would have 
to contain the identity element 0 and 3 + 3, which is 6. Then it has to contain 6 + 3, 
which is 9. Note that the inverse of 3 is 9 and the inverse of 6 is 6. It is easily checked 
that H = {0, 3, 6, 9} is a subgroup of Zp, and it is the smallest subgroup containing 3. 

Let us imitate this reasoning in a general situation. As we remarked before, for 
a general argument we always use multiplicative notation. Let G be a group and let 
aéG. A subgroup of G containing a must, by Theorem 5.14, contain a”, the result 
of computing products of a and itself for n factors for every positive integer n. These 
positive integral powers of a do give a set closed under multiplication. It is possible, 
however, that the inverse of a is not in this set. Of course, a subgroup containing a must 
also contain a~!, and, in general, it must contain a~” for allm € Zt. It must contain the 
identity element e = a°. Summarizing, a subgroup of G containing the element a must 
contain all elements a" (or na for additive groups) for all n € Z. That is, a subgroup 
containing a must contain {a"\n € Z}. Observe that these powers a” of a need not be 
distinct. For example, in the group V of Example 5.9, 


2 = 
a” = 6, a=a, d=e, at=a, and so on. 


We have almost proved the next theorem. 


Let G be a group and let a € G. Then 
H={a"|neZ} 


is a subgroup of G and is the smallest’ subgroup of G that contains a, that is, every 
subgroup containing a contains H. 


* We may find occasion to distinguish between the terms minimal and smallest as applied to subsets of a set $ 
that have some property. A subset H of S is minimal with respect to the property if H has the property, and 
no subset K C H, K # H, has the property. If H has the property and H C K for every subset K with the 
property, then H is the smallest subset with the property. There may be many minimal subsets, but there can 
be only one smallest subset. To illustrate, {e, a}, {e, b}, and {e, c} are all minimal nontrivial subgroups of the 
group V. (See Fig. 5.12.) However, V contains no smallest nontrivial subgroup. 
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5.18 Definition 


5.19 Definition 


5.20 Example 


5.21 Example 


5.22 Example 


5.23 Example 


Groups and Subgroups 


We check the three conditions given in Theorem 5.14 for a subset of a group to give a 
subgroup. Since a”a* = a’t* forr, s € Z, we see that the product in G of two clements 
of H is again in H. Thus H is closed under the group operation of G. Also a® =e, so 
e € H,and fora’ ¢ H,a~’ € H anda™’a’ =e. Hence all the conditions are satisfied, 
and H <G. 

Our arguments prior to the statement of the theorem showed that any subgroup of 
G containing a must contain H, so H is the smallest subgroup of G containing a. ¢ 


Let G be a group and let a € G. Then the subgroup {a” |n € Z} of G, characterized 
in Theorem 5.17, is called the cyclic subgroup of G generated by a, and denoted 
by (a). | 


An element a of a group G generates G and is a generator for G if (a) = G. A group 
G is eyelic if there is some element a in G that generates G. | 


Let Z4 and V be the groups of Example 5.9. Then Z, is cyclic and both 1 and 3 are 
generators, that is, 


(1) = (3) = Zy. 


However, V is not cyclic, for (a), (b), and (c) are proper subgroups of two elements. Of 
course, (e) is the trivial subgroup of one element. A 


The group Z under addition is a cyclic group. Both 1 and —1 are generators for this 
group, and they are the only generators. Also, for n € Z*, the group Z, under addition 
modulo n is cyclic. If n > 1, then both 1 and n — | are generators, but there may be 
others. A 


Consider the group Z under addition. Let us find (3). Here the notation is additive, and 
(3) must contain 


3, 343=6, 34343=9, and so on, 
0, 3, 34+-3=-6, 34+-34+-3=-9, andsoon. 


In other words, the cyclic subgroup generated by 3 consists of all multiples of 3, positive, 
negative, and zero. We denote this subgroup by 3Z as well as (3). In a similar way, we 
shall let nZ be the cyclic subgroup (n) of Z. Note that 6Z < 3Z. a 


For each positive integer 1, Ict U,, be the multiplicative group of the nth roots of unity 
in C. These elements of U,, can be represented geometrically by equally spaced points 
on a circle about the origin, as illustrated in Fig. 5.24. The heavy point represents the 
number 


2x |, On 
¢ =cos — +7sin—. 
n n 
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The geometric interpretation of multiplication of complex numbers, explained in Sec- 
tion 1, shows at once that as ¢ is raised to powers, it works its way counterclockwise 
around the circle, landing on each of the elements of U,, in turn. Thus U,, under multi- 
plication is a cyclic group, and ¢ is a generator. The group U,, is the cyclic subgroup (¢) 
of the group U of all complex numbers z, where |z| = 1, under multiplication. A 


5.24 Figure 


EXERCISES 5 


Computations 


In Exercises 1 through 6, determine whether the given subset of the complex numbers is a subgroup of the group 
C of complex numbers under addition. 


1.R 2. QF 3. 7Z 
4. The set iR of pure imaginary numbers including 0 
5. The set 2 Q of rational multiples of x 6. The set {7" |n € Z} 


7, Which of the sets in Exercises | through 6 are subgroups of the group C* of nonzero complex numbers under 
multiplication? 


In Exercises 8 through 13, determine whether the given set of invertible n x m matrices with real number entries is 
a subgroup of GL(n, R). 
8. Then x n matrices with determinant 2 
9. The diagonal n x n matrices with no zeros on the diagonal 
10. The upper-triangular n x n matrices with no zeros on the diagonal 
11. Then x n matrices with determinant —1 
12. Then x n matrices with determinant —1 or 1 


13. The set of all n x nm matrices A such that (A’)A = 1,. [These matrices are called orthogonal. Recall that A’, 
the transpose of A, is the matrix whose jth column is the jth row of A for 1 < 7 <n, and that the transpose 
operation has the property (AB)? =(B’)\(A’).] 
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Let F be the set of all real-valued functions with domain R and let F be the subset of F consisting of those functions 
that have a nonzero value at every point in R. In Exercises 14 through 19, determine whether the given subset of F 
with the induced operation is (a) a subgroup of the group F under addition, (b) a subgroup of the group F under 
multiplication. 


14. The subset F 

15. The subset of all f € F such that f(1) = 0 
16. The subset of all f € F such that f(1) =1 
17. The subset of all f € F such that f(0) = 1 
18. The subset of all f € F such that f(0) = — 
19. The subset of all constant functions in F. 


20. Nine groups are given below. Give a complete list of all subgroup relations, of the form G; < G;, that exist 
between these given groups G;, G2,---, Go. 
G, = Z under addition 
G» = 12Z under addition 
G3 = Q' under multiplication 
G4 = R under addition 
Gs = R* under multiplication 
Go = {x"|n € Z} under multiplication 
G7 = 3Z under addition 
Gg = the set of all integral multiples of 6 under addition 
Go = {6" |n € Z} under multiplication 


21. Write at least 5 elements of each of the following cyclic groups. 
a. 25Z under addition 
b. {()" |n © Z} under multiplication 
c. {2"|n € Z} under multiplication 


In Exercises 22 through 25, describe all the elements in the cyclic subgroup of GL(2, R) generated by the given 
2 x 2 matrix. 


0 -l 1 1 3.0 0 -2 
m{ oj] ea 24, | “| 2.| 5 a| 


26. Which of the following groups are cyclic? For each cyclic group, list all the generators of the group. 
G, = (Z,+) Gr=(Q,+) Gs=(Q*,-) Gs = (6Z,+) 
Gs = {6" |n € Z} under multiplication 


= {a+ bV2| a,b € Z} under addition 


In Exercises 27 through 35, find the order of the cyclic subgroup of the given group generated by the indicated 
element. 

27. The subgroup of Z,4 generated by 3 

28. The subgroup of V generated by c oot Table 5. oe 

29. The subgroup of U, generated by cos = +i sin = 

30. The subgroup of Us generated by cos yi sin = 


31. The subgroup of Us generated by cos 22 7 +i sin % 
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32. The subgroup of Us generated by cos a +i sin on 
33. The subgroup of the multiplicative group G of invertible 4 x 4 matrices generated by 
0 0 1 0 
000 1 
10 0 0 
0 1 0 0 
34. The subgroup of the multiplicative group G of invertible 4 x 4 matrices generated by 
000 1 
0 0 1 0 
1 0 0 0 
0 1 0 0 
35. The subgroup of the multiplicative group G of invertible 4 x 4 matrices generated by 
0 1 0 0 
00 0 1 
00 1 0 
100 0 
36. a. Complete Table 5.25 to give the group Z, of 6 elements. 
b. Compute the subgroups (0), (1), (2), (3), (4), and (5) of the group Z¢ given in part (a). 
c. Which elements are generators for the group Ze of part (a)? 
d. 


of Le ) 
5.25 Table 


Concepts 


inverse of each of its elements. 


38. A group G is cyclic if and only if there exists a € G such that G = {a" |n € Z}. 
39. Mark each of the following true or false. 


a. The associative law holds in every group. 
b. There may be a group in which the cancellation law fails. 


37 


. Give the subgroup diagram for the part (b) subgroups of Zs. (We will see later that these are all the subgroups 


In Exercises 37 and 38, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 


37. A subgroup of a group G is a subset H of G that contains the identity element e of G and also contains the 
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_______ e. Every group is a subgroup of itself. 


_______ d. Every group has exactly two improper subgroups. 


_______ e. In every cyclic group, every element is a generator. 

f. A cyclic group has a unique generator. 

g. Every set of numbers that is a group under addition is also a group under multiplication. 
h. A subgroup may be defined as a subset of a group. 


i. Zq is acyclic group. 
j. Every subset of every group is a subgroup under the induced operation. 


Show by means of an example that it is possible for the quadratic equation x? =e to have more than two 
solutions in some group G with identity e. 


Theory 


In Exercises 41 and 42, let 6: G > G’ be an isomorphism of a group (G, *) with a group (G’, *'). Write out a 
proof to convince a skeptic of the intuitively clear statement. 


41. 


42. 
43. 


44, 


45. 


46. 
47. 


48. 


49, 
50. 


51. 


53. 


If H is a subgroup of G, then d(H] = {¢(h) | h € H} is a subgroup of G’. That is, an isomorphism carries 
subgroups into subgroups. 


If G is cyclic, then G’ is cyclic. 
Show that if H and K are subgroups of an abelian group G, then 
{hk|hé€ H andk € K} 


is a subgroup of G. 


Find the flaw in the following argument: “Condition 2 of Theorem 5.14 is redundant, since it can be derived 
from 1 and 3, for leta € H. Then a7! € H by 3, and by 1, aa~! = ¢ is an element of H, proving 2.” 


Show that a nonempty subset H of a group G is a subgroup of G if and only if ab"! € H forall a,b € H. 
(This is one of the more compact criteria referred to prior to Theorem 5.14) 


Prove that a cyclic group with only one generator can have at most 2 elements. 


Prove that if G is an abelian group, written multiplicatively, with identity element ¢, then all elements x of G 
satisfying the equation x* = e form a subgroup H of G. 


Repeat Exercise 47 for the general situation of the set H of all solutions x of the equation x” = e for a fixed 
integer n > 1 in an abelian group G with identity e. 


Show that if a € G, where G is a finite group with identity ¢, then there exists n € Z* such that a” = e. 


Let a nonempty finite subset H of a group G be closed under the binary operation of G. Show that H is a 
subgroup of G. 


Let G be a group and let a be one fixed element of G. Show that 
H, = {x € G|xa = ax} 


is a subgroup of G. 


. Generalizing Exercise 51, let S be any subset of a group G. 


a. Show that Hs = {x € G| xs = sx forall s € S} is a subgroup of G. 
b. In reference to part (a), the subgroup Hg is the center of G. Show that Hg is an abelian group. 


Let H be a subgroup of a group G. For a,b € G, let a ~ b if and only if ab~' € H. Show that ~ is an 
equivalence relation on G. 


54. 


55. 
56. 


57. 
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For sets H and K, we define the intersection H M K by 


HOOK ={x|x € Handx e€ K}. 


Show that if H < Gand K < G,then HM K < G. (Remember: < denotes “is a subgroup of,” not “is a subset 


of.”) 


Prove that every cyclic group is abelian. 


Let G be a group and let G, = {g” | g € G}. Under what hypothesis about G can we show that G,, is asubgroup 


of G? 


Show that a group with no proper nontrivial subgroups is cyclic. 


6.1 Theorem 
Proof 


Cyciic Groups 
Recall the following facts and notations from Section 5. If G is a group anda € G, then 
H={a"|neZ} 


is a subgroup of G (Theorem 5.17). This group is the cyclic subgroup (a) of G generated 
by a. Also, given a group G and an element a in G, if 


G = {a"|n €Z}, 


then a is a generator of G and the group G = (a) is cyclic. We introduce one new bit of 
terminology. Let a be an element of a group G. If the cyclic subgroup (a) of G is finite, 
then the order of a is the order |(a)| of this cyclic subgroup. Otherwise, we say that a 
is of infinite order. We will see in this section that if a € G is of finite order m, then m 
is the smallest positive integer such that a” = e. 

The first goal of this section is to describe all cyclic groups and all subgroups of 
cyclic groups. This is not an idle exercise. We will see later that cyclic groups serve 
as building blocks for all sufficiently small abelian groups, in particular, for all finite 
abelian groups. Cyclic groups are fundamental to the understanding of groups. 


Elementary Properties of Cyclic Groups 


We start with a demonstration that cyclic groups are abelian. 


Every cyclic group is abelian. 
Let G be a cyclic group and let a be a generator of G so that 
G = (a) = {a" |n € Z}. 


If g, and go are any two elements of G, there exist integers r and s such that g; =a” 
and g. = a’. Then 


+S St 


gig2g=ada =a a’ =a*da’ = gg, 


so G is abelian. Sd 


We shall continue to use multiplicative notation for our general work on cyclic 
groups, even though they are abelian. 
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The division algorithm that follows is a seemingly trivial, but very fundamental tool 
for the study of cyclic groups. 


r n 
n>0,¢20- re ae 
—m 0) m 2m qm (q+1)m 
r nh 
n<0.¢<0 f. 
qm (q+ 1)m m 0 m 2m 


6.2 Figure 


6.3 Division Algorithm for Z If m is a positive integer and n is any integer, then there exist unique integers q 


Proof 


6.4 Example 


Solution 


6.5 Example 


Solution 


and r such that 
n=mg+r and O<r<m. 


We give an intuitive diagrammatic explanation, using Fig. 6.2. On the real x-axis of 
analytic geometry, mark off the multiples of m and the position of n. Now n falls either 
on a multiple gm of m and r can be taken as 0, or n falls between two multiples of m. 
If the latter is the case, let gm be the first multiple of m to the left of n. Then r is as 
shown in Fig. 6.2. Note that 0 <r < m. Uniqueness of q andr follows since if 7 is not 
a multiple of m so that we can take r = 0, then there is a unique multiple gm of m to the 
left of n and at distance less than m from n, as illustrated in Fig. 6.2. Sd 


In the notation of the division algorithm, we regard q as the quotient and 7 as the 
nonnegative remainder when n is divided by m. 


Find the quotient g and remainder r when 38 is divided by 7 according to the division 
algorithm. 


The positive multiples of 7 are 7, 14, 21, 28, 35, 42,---. Choosing the multiple to leave 
a nonnegative remainder less than 7, we write 


38 = 354+3=7(5)4+3 


so the quotient is g = 5 and the remainder is r = 3. A 


Find the quotient g and remainder r when —38 is divided by 7 according to the division 
algorithm. 


The negative multiples of 7 are —7, —14, —21, —28, —35, —42,---. Choosing the mul- 
tiple to leave a nonnegative remainder less than 7, we write 

—38 = -42+4=7(-6)+4 
so the quotient is ¢ = —6 and the remainder is r = 4. A 


We will use the division algorithm to show that a subgroup H of a cyclic group G 
is also cyclic. Think for a moment what we will have to do to prove this. We will have to 


6.6 Theorem 


Proof 


6.7 Corollary 
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use the definition of a cyclic group since we have proved little about cyclic groups yet. 
That is, we will have to use the fact that G has a generating element a. We must then 
exhibit, in terms of this generator a, some generator c = a™ for H in order to show that 
H is cyclic. There is really only one natural choice for the power m of a to try. Can you 
guess what it is before you read the proof of the theorem? 

A subgroup of a cyclic group is cyclic. 


Let G be acyclic group generated by a and let H be a subgroup of G. If H = {e}, then 
H = (e) is cyclic. If H # {e}, then a” € H for some n € Z*. Let m be the smallest 
integer in Z* such that a” € H. 

We claim that c = a” generates H; that is, 


H =(a™) = (c). 


We must show that every b € H is a power of c. Since b € H and H < G, we have 
b = a" for some n. Find g andr such that 


n=m@qt+r for O<r<m 
in accord with the division algorithm. Then 
a" =a” = (a™)Jta’, 
so 
a’ =(a") 4a". 
Now since a" € H,a™ ¢ H, and H isa group, both (a”)~? and a” are in H. Thus 
(a") 4a" EH; that is, a ed. 


Since m was the smallest positive integer such that a” ¢ H and 0 <r <m, we must 
have r = 0. Thus n = gm and 


b=a" =(a") =c!4, 
so b is a power of c. ¢ 


As noted in Examples 5.21 and 5.22, Z under addition is cyclic and for a positive 
integer n, the set nZ of all multiples of n is a subgroup of Z under addition, the cyclic 
subgroup generated by n. Theorem 6.6 shows that these cyclic subgroups are the only 
subgroups of Z under addition. We state this as a corollary. 


The subgroups of Z under addition are precisely the groups nZ under addition forn € Z. 


This corollary gives us an elegant way to define the greatest common divisor of 
two positive integers r and s. Exercise 45 shows that H = {mr+ms|n,meéZhisa 
subgroup of the group Z under addition. Thus H must be cyclic and have a generator ¢, 
which we may choose to be positive. 
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6.8 Definition 


6.9 Example 


Solution 


Groups and Subgroups 


Let r and s be two positive integers. The positive generator d of the cyclic group 
H ={nr+ms\|n,me Z} 


under addition is the greatest common divisor (abbreviated gcd) of r and s. We write 
d = gcd(r, 5). | 


Note from the definition that d is a divisor of both r and s since both r = 1r + Os 
and s = Or + 1s are in H. Since d € H, we can write 


d=nr+ms 


for some integers n and m. We see that every integer dividing both r and s divides the 
right-hand side of the equation, and hence must be a divisor of d also. Thus d must 
be the largest number dividing both r and s; this accounts for the name given to d in 
Definition 6.8. 


Find the ged of 42 and 72. 


The positive divisors of 42 are 1, 2, 3, 6, 7, 14, 21, and 42. The positive divisors of 72 
are 1, 2, 3, 4, 6, 8, 9, 12, 18, 24, 36, and 72. The greatest common divisor is 6. Note 
that 6 = (3)(72) + (—5)(42). There is an algorithm for expressing the greatest common 
divisor d of r and s in the form d =nr + ms, but we will not need to make use of it 
here. A 


Two positive integers are relatively prime if their gcd is 1. For example, 12 and 25 
are relatively prime. Note that they have no prime factors in common. In our discussion 
of subgroups of cyclic groups, we will need to know the following: 


Ifr ands are relatively prime and ifr divides sm, thenr must divide m. " 


Let’s prove this. If r and s are relatively prime, then we may write 
l=ar+bs for some a,beZ. 
Multiplying by m, we obtain 
m=arm-+ bsm. 


Now r divides both arm and bsm since r divides sm. Thus r is a divisor of the right-hand 
side of this equation, so r must divide m. 


The Structure of Cyclic Groups 


We can now describe all cyclic groups, up to an isomorphism. 


6.10 Theorem 


Proof 


6.13 Example 
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Let G be acyclic group with generator a. If the order of G is infinite, then G is isomorphic 
to (Z, +). If G has finite order n, then G is isomorphic to (Zy, +p). 


CaseI_ For all positive integers m, a” # e. In this case we claim that no two 
distinct exponents h and k can give equal elements a’ and ak of G. 
Suppose that a” = a‘ and say h > k. Then 


contrary to our Case I assumption. Hence every element of G can be 
expressed as a” for a unique m € Z. The map @ : G — Z given by 
¢(a') = i is thus well defined, one to one, and onto Z. Also, 


oa!) = O(a) =i+j =6@')+o@), 
so the homomorphism property is satisfied and ¢@ is an isomorphism. 
Case II a” = e for some positive integer m. Let n be the smallest positive 
integer such that a” =e. Ifs € Zands =nq +r for0 <r <n, then 
a’ = q"4*" — (a")?a’ = ef a’ =a". Asin Case 1,if0 <k <h <nand 
a’ = a*, then a’ = e and 0 < A —k <n, contradicting our choice of 
n. Thus the elements 


a =e,a,a’,a’,---,a"! 


are all distinct and comprise all elements of G. The map y : G > Z, 
given by y(a') =i fori = 0, 1,2,---,— 1 is thus well defined, one to 
one, and onto Z,,. Because a” = e, we see that a'a/ = a* where 

k =i+, j. Thus 


waa!) =i +, j = Wa") 4n Va’), 


so the homomorphism property is satisfied and yw is an isomorphism. 
¢ 


ie) 


n-] 1 


6.11 Figure 6.12 Figure 


Motivated by our work with U,, it is nice to visualize the elements e = a°,a',a?,---, 
a"! of acyclic group of order n as being distributed evenly on a circle (see Fig. 6.11). The 
element a” is located h of these equal units counterclockwise along the circle, measured 
from the bottom where e = a® is located. To multiply a’ and a* diagrammatically, we 
start from a” and go & additional units around counterclockwise. To see arithmetically 
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where we end up, find g andr such that 
h+k=nqt+r for O<r<n. 
The ng takes us all the way around the circle g times, and we then wind upata". A 


Figure 6.12 is essentially the same as Fig. 6.11 but with the points labeled with the 
exponents on the generator. The operation on these exponents is addition modulo n. 


Subgroups of Finite Cyclic Groups 


We have completed our description of cyclic groups and turn to their subgroups. Corollary 
6.7 gives us complete information about subgroups of infinite cyclic groups. Let us give 
the basic theorem regarding generators of subgroups for the finite cyclic groups. 


Let G be acyclic group with n elements and generated by a. Let b ¢ G and let b = a* 
Then b generates a cyclic subgroup H of G containing n/d elements, where d is the 
greatest common divisor of ands. Also, (a°) = (a’) if and only if ged(s, n) = gcd(t, n). 


That b generates a cyclic subgroup H of G is known from Theorem 5.17. We need show 
only that H has n/d elements. Following the argument of Case II of Theorem 6.10, we 
see that H has as many elements as the smallest positive power m of b that gives the 
identity. Now b = a‘, and b” = e if and only if (a°)” = e, or if and only if n divides 
ms. What is the smallest positive integer m such that n divides ms? Let d be the ged of 
n ands. Then there exists integers u and v such that 


d=un+vs. 
Since d divides both n and s, we may write 
1 =u(n/d)+v(s/d) 


where both n/d and s/d are integers. This last equation shows that n/d and s/d are 
relatively prime, for any integer dividing both of them must also divide 1. We wish to 
find the smallest positive m such that 

ms  m(s/d). . 

— = ——— 1s an integer. 

n (n/d) 
From the boxed division property (1), we conclude that n/d must divide m, so the 
smallest such m is n/d. Thus the order of Hf is n/d. 

Taking for the moment Z,, as a model for a cyclic group of order n, we see that if d is 

a divisor of n, then the cyclic subgroup (d) of Z, had n/d elements, and contains all the 
positive integers m less than n such that gcd(m, n) = d. Thus there is only one subgroup 
of Z, of order n/d. Taken with the preceding paragraph, this shows at once that if 
a is a generator of the cyclic group G, then (a°) = (a‘) if and only if ged(s,n) = 
ged(t, n). . 


For an example using additive notation, consider Z12, with the generator a = 1. Since 
the greatest common divisor of 3 and 12 is 3, 3 = 3-1 generates a subgroup of 2 =4 
elements, namely 


(3) = {0, 3, 6, 9}. 


6.16 Corollary 


6.17 Example 
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Since the gcd of 8 and 12 is 4, 8 generates a subgroup of 2 = 3 elements, namely, 
{8} = {0, 4, 8}. 


Since the ged of 12 and 5 is 1, 5 generates a subgroup of i = 12 elements; that is, 5 is 
a generator of the whole group Z)2. A 


The following corollary follows immediately from Theorem 6.14. 


Tf a is a generator of a finite cyclic group G of order n, then the other generators of G 
are the elements of the form a", where r is relatively prime to n. 


Let us find all subgroups of Zig and give their subgroup diagram. All subgroups are 
cyclic. By Corollary 6.16, the elements 1, 5,7, 11, 13, and 17 are all generators of Zyg. 
Starting with 2, 


(2) = {0, 2, 4, 6, 8, 10, 12, 14, 16}. 


is of order 9 and has as generators elements of the form 2, where h is relatively prime 
to 9, namely, A = 1, 2,4, 5, 7, and 8, so A2 = 2, 4, 8, 10, 14, and 16. The element 6 of 
(2) generates {0, 6, 12}, and 12 also is a generator of this subgroup. 

We have thus far found all subgroups generated by 0, 1, 2, 4, 5, 6, 7, 8, 10, 11, 12, 
13, 14, 16, and 17. This leaves just 3, 9, and 15 to consider. 


(3) = {0, 3, 6, 9, 12, 15}, 


and 15 also generates this group of order 6, since 15 = 5 - 3, and the gcd of 5 and 6 is 1. 
Finally, 


(9) = {0, 9}. 


The subgroup diagram for these subgroups of Zjg is given in Fig. 6.18. 


4, 
agi 
ig 


6.18 Figure Subgroup diagram for Zj,. 


This example is straightforward; we are afraid we wrote it out in such detail that it 
may look complicated. The exercises give some practice along these lines. A 
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™ EXERCISES 6 


Computations 

In Exercises 1 through 4, find the quotient and remainder, according to the division algorithm, when n is divided 
by m. 

ln=42,.m=9 2.n=—-42,m =9 

3.n=-—50,m=8 4.n=50,m=8 


In Exercises 5 through 7, find the greatest common divisor of the two integers. 


§. 32 and 24 6. 48 and 88 7. 360 and 420 
In Exercises 8 through 11, find the number of generators of a cyclic group having the given order. 
8. 5 9. 8 10. 12 11. 60 


An isomorphism of a group with itself is an automorphism of the group. In Exercises 12 through 16, find the 
number of automorphisms of the given group. 
[Hint: Make use of Exercise 44. What must be the image of a generator under an automorphism?] 


12. Z 13. Ze 14. Zs 15. Z 16. Zi. 

In Exercises 17 through 21, find the number of elements in the indicated cyclic group. 

17. The cyclic subgroup of Z30 generated by 25 

18. The cyclic subgroup of Z42 generated by 30 

19. The cyclic subgroup (i) of the group C* of nonzero complex numbers under multiplication 
20. The cyclic subgroup of the group C* of Exercise 19 generated by (1 + 1) ee 

21. The cyclic subgroup of the group C* of Exercise 19 generated by 1 + i 


In Exercises 22 through 24, find all subgroups of the given group, and draw the subgroup diagram for the subgroups. 


22. Zi2 23. Z36 24. Zs 

In Exercises 25 through 29, find all orders of subgroups of the given group. 

25. Ze 26. Zs 27, Zy2 28. Zo 29. Zi7 
Concepts 


In Exercises 30 and 31, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 

30. An clement a of a group G has order n € Z* if and only if a” =e. 

31. The greatest common divisor of two positive integers is the largest positive integer that divides both of them. 


32. Mark each of the following true or false. 


a. Every cyclic group is abclian. 

b. Every abelian group is cyclic. 

______ ¢. Qunder addition is a cyclic group. 

______ d.. Every element of every cyclic group generates the group. 
______ e. There is at least one abelian group of every finite order >0. 


f. Every group of order <4 is cyclic. 


Section6 Exercises 67 


g. All generators of Zy9 are prime numbers. 

h. If G and G’ are groups, then G NG’ is a group. 

i. If H and K are subgroups of a group G, then HM K is a group. 

j. Every cyclic group of order >2 has at least two distinct generators. 


In Exercises 33 through 37, either give an example of a group with the property described, or explain why no 
example exists. 


33. 
34. 
35. 
36. 
37. 


A finite group that is not cyclic 

An infinite group that is not cyclic 

Acyclic group having only one generator 

An infinite cyclic group having four generators 


A finite cyclic group having four generators 


The generators of the cyclic multiplicative group U,, of all nth roots of unity in C are the primitive nth roots of 
unity. In Exercises 38 through 41, find the primitive nth roots of unity for the given value of n. 


38. 
39. 
40. 
41. 


n=4 
n=6 
n=8 
n=12 


Proof Synopsis 


42. 
43. 


Give a one-sentence synopsis of the proof of Theorem 6.1. 


Give at most a three-sentence synopsis of the proof of Theorem 6.6. 


Theory 


44. 


45, 
46. 
47. 


48. 
49. 


50. 


51. 


Let G be a cyclic group with generator a, and let G’ be a group isomorphic to G. If 6: G > G’ is an 
isomorphism, show that, for every x € G, d(x) is completely determined by the value ¢(a). That is, if @: 
G — G'andw : G > G’ are two isomophisms such that ¢(a) = y(a), then 6(x) = w(x) for all x € G. 


Let r and s be positive integers. Show that {nr + ms | n,m € Z} is a subgroup of Z. 
Let a and b be elements of a group G. Show that if ab has finite order n, then ba also has order n. 


Let r and s be positive integers. 


‘a. Define the least common multiple of r and s as a generator of a certain cyclic group. 


b. Under what condition is the least common multiple of r and s their product, rs? 


ce. Generalizing part (b), show that the product of the greatest common divisor and of the least common multiple 
ofr ands isrs. 


Show that a group that has only a finite number of subgroups must be a finite group. 


Show by a counterexample that the following “converse” of Theorem 6.6 is not a theorem: “If a group G is 
such that every proper subgroup is cyclic, then G is cyclic.” 


Let G be a group and suppose a € G generates a cyclic subgroup of order 2 and is the unique such clement. 
Show that ax = xa for all x € G. [Hint: Consider (xax7!)*.] 


Let p and q be distinct prime numbers. Find the number of generators of the cyclic group Zp,. 
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52. 
53. 


54. 
55. 
56. 
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Let p be a prime number. Find the number of generators of the cyclic group Z,-, where r is an integer > 1. 


Show that in a finite cyclic group G of order n, written multiplicatively, the equation x” = e has exactly m 
solutions x in G for cach positive integer m that divides n. 


With reference to Exercise 53, what is the situation if 1 < _m <n and m does not divide n? 


Show that Zp has no proper nontrivial subgroups if p is a prime number. 


Let G be an abelian group and let H and K be finite cyclic subgroups with |H7| =r and|K|=s. 


a. Show that ifr and s are relatively prime, then G contains a cyclic subgroup of order rs. 
b. Generalizing part (a), show that G contains a cyclic subgroup of order the least common multiple of r and s. 


7.1 Example 


7.2 Example 


GENERATING SETS AND CAYLEY DIGRAPHS 


Let G be a group, and let a € G. We have described the cyclic subgroup (a) of G, which 
is the smallest subgroup of G that contains the element a. Suppose we want to find as 
small a subgroup as possible that contains both a and b for another element b in G. By 
Theorem 5.17, we sce that any subgroup containing @ and b must contain a” and b” for 
alim,n € Z, and consequently must contain all finite products of such powers of a and b. 
For example, such an expression might be a*b*a~*b*a°. Note that we cannot “simplify” 
this expression by writing first all powers of a followed by the powers of b, since G may 
not be abelian. However, products of such expressions are again expressions of the same 
type. Furthermore, e = @® and the inverse of such an expression is again of the same 
type. For example, the inverse of a*b4a~*b’a° is a>b~a3b—“‘a*. By Theorem 5.14, 
this shows that all such products of integral powers of a and b form a subgroup of G, 
which surely must be the smallest subgroup containing both a and b. We call a and b 
generators of this subgroup. If this subgroup should be all of G, then we say that {a, b} 
generates G. Of course, there is nothing sacred about taking just two elements a, b € G. 
We could have made similar arguments for three, four, or any number of elements of G, 
as long as we take only finite products of their integral powers. 


The Klein 4-group V = {e, a, b, c} of Example 5.9 is generated by {a, b} since ab = c. 
Itis also generated by {a, c}, {b, c}, and {a, b, c}. Ifa group G is generated by a subset S, 
then every subset of G containing S generates GC. A 


The group Ze is generated by {1} and {5}. It is also generated by {2, 3} since 2 + 3 =5, 
so that any subgroup containing 2 and 3 must contain 5 and must therefore be Ze. It is 
also generated by {3, 4}, {2, 3, 4}, {1, 3}. and {3, 5}, but it is not generated by {2, 4} 
since (2) = {0, 2, 4} contains 2 and 4. A 


We have given an intuitive explanation of the subgroup of a group G generated by 
a subset of G. What follows is a detailed exposition of the same idea approached in 
another way, namely via intersections of subgroups. After we get an intuitive grasp of 
a concept, it is nice to try to write it up as neatly as possible. We give a set-theoretic 
definition and generalize a theorem that was in Exercise 54 of Section 5. 


7.3 Definition 


7.4 Theorem 


Proof 


7.5 Definition 


7.6 Theorem 


Proof 
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Let {5; |i € 7} be acollection of sets. Here J may be any set of indices. The intersection 
Nie S; of the sets 5S; is the set of all elements that are in all the sets S;; that is, 


Si = {x |x € S; for alli € I}. 
te 


If / is finite, J = {1,2,...,n}, we may denote N;<,5; by 
Si, M820+++ A Sy. | 


The intersection of some subgroups H; of a group G fori € J is again a subgroup of G. 


Let us show closure. Let a € Njc;H; and b € Nje;H;, so that a € A; for alli € J and 
b € H, for alli ¢ I. Thenab € H; for alli € J, since H; is a group. Thus ab € Nje7 Hj. 
Since H; is a subgroup for all i € J, we have e € H; for all i ¢ J, and hence 
e€ Mer. 
Finally, for a € M;¢;H;, we have a € Hj for alli e J, so a! €H, forallie I, 
which implies that a~! € Ne; Hj. 


Let G be a group and let a; € G for i € J. There is at least one subgroup of G 
containing all the elements a; fori € 7, namely G is itself. Theorem 7.4 assures us that 
if we take the intersection of all subgroups of G containing all a; fori € J, we will obtain 
a subgroup H of G. This subgroup H is the smallest subgroup of G containing all the 
a; fori € I. 


Let G be a group and let a; € G for i € J. The smallest subgroup of G containing 
{a; |i € I} is the subgroup generated by {a; |i € J}. If this subgroup is all of G, then 
{a; |i € I} generates G and the a; are generators of G. If there is a finite set {a; |i € I} 
that generates G, then G is finitely generated. | 


Note that this definition is consistent with our previous definition of a generator for 
acyclic group. Note also that the statement a is a generator of G may mean either that 
G = (a) or that a is a member of a subset of G that generates G. The context in which 
the statement is made should indicate which is intended. Our next theorem gives the 
structural insight into the subgroup of G generated by {a; |i € Z} that we discussed for 
two generators before Example 7.1. 


If G is a group and a; € G fori € J, then the subgroup H of G generated by {a; |i € I} 
has as elements precisely those elements of G that are finite products of integral powers 
of the a;, where powers of a fixed a; may occur several times in the product. 


Let K denote the set of all finite products of integral powers of the a;. Then K C H. 
We need only observe that K is a subgroup and then, since H is the smallest subgroup 
containing a; for i € J, we will be done. Observe that a product of elements in K is 
again in K. Since (a;)° = e, we have e € K. For every element k in K, if we form from 
the product giving k a new product with the order of the a; reversed and the opposite 
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sign on all exponents, we have k-1, which is thus in K. For example, 


[a @yray P= Cay)" @2) ay, 
which is again in K. ¢ 


Cayley Digraphs 


For each generating set S of a finite group G, there is a directed graph representing the 
group in terms of the generators in §. The term directed graph is usually abbreviated as 
digraph. These visual representations of groups were devised by Cayley, and are also 
referred to as Cayley diagrams in the literature. 

Intuitively, a digraph consists of a finite number of points, called vertices of the 
digraph, and some ares (each with a direction denoted by an arrowhead) joining vertices. 
In a digraph for a group G using a generating set S we have one vertex, represented by 
a dot, for each element of G. Each generator in S is denoted by one type of arc. We 
could use different colors for different arc types in pencil and paperwork. Since different 
colors are not available in our text, we use different style arcs, like solid, dashed, and 
dotted, to denote different generators. Thus if S = {a, b, c} we might denote 


a by ——>—_-, bby ---»>----; and c DY Pees 


With this notation, an occurrence of xe——>——*y in a Cayley digraph means that 
xa — y. Thatis, traveling an arc in the direction of the arrow indicates that multiplication 
of the group element at the start of the arc on the right by the generator corresponding 
to that type of arc yields the group element at the end of the arc. Of course, since 
we are in a group, we know immediately that ya~' = x. Thus traveling an are in the 
direction opposite to the arrow corresponds to multiplication on the right by the inverse 
of the corresponding generator. If a generator in S is its own inverse, it is customary to 
denote this by omitting the arrowhead from the arc, rather than using a double arrow. 
For example, if b> = e, we might denote b by ------—-. 


Both of the digraphs shown in Fig. 7.8 represent the group Ze with generating set $ = {1}. 
Neither the length and shape of an arc nor the angle between arcs has any significance. 


A 
0 
5 1 0 1 2 3 
4 2 
4 
3 5 
(a) (b) 


7.8 Figure Two digraphs for Zp with S = {1} using ae a 
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0 
0 3 A 
4 1 4 7) 
(a) (b) 
7.9 Figure Two digraphs for Z, with S = {2, 3} using SS and — —— aut 


Both of the digraphs shown in Fig. 7.9 represent the group Z¢ with generating set S = 
{2, 3}. Since 3 is its own inverse, there is no arrowhead on the dashed arcs representing 3. 
Notice how different these Cayley diagrams look from those in Fig. 7.8 for the same 
group. The difference is due to the different choice for the set of generators. A 


Every digraph for a group must satisfy these four properties for the reasons indicated. 


Property Reason 
1. The digraph is connected, that is, Every equation gx = / has a solution 
we can get from any vertex g to in a group. 


any vertex / by traveling along 
consecutive arcs, starting at g and 


ending at h. 

2. At most one arc goes from a vertex The solution of gx = h is unique. 
g to a vertex h. 

3. Each vertex g has exactly one arc For g € G and each generator b we 
of each type starting at g, and one can compute gb, and (gb~')b = g. 
of each type ending at g. 

4. If two different sequences of arc If gg =h and gr =h, then ug = 
types starting from vertex g lead ug th = ur. 


to the same vertex h, then those 
same sequences of arc types starting 
from any vertex u will lead to 

the same vertex v. 


It can be shown that, conversely, every digraph satisfying these four properties is a Cayley 
digraph for some group. Due to the symmetry of such a digraph, we can choose labels 
like a, b,c for the various arc types, name any vertex e to represent the identity, and 
name each other vertex by a product of arc labels and their inverses that we can travel 
to attain that vertex starting from the one that we named e. Some finite groups were first 
constructed (found) using digraphs. 
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(a) (b) 
7.11 Figure 


7.12 Example A digraph satisfying the four properties on page 71 is shown in Fig. 7.11 (a). To obtain 
Fig. 7.11 (b), we selected the labels 


named a vertex e, and then named the other vertices as shown. We have a group 
{e,a, a’, a3, b, ab, a7b, a>b} of eight elements. Note that the vertex that we named 
ab could equally well be named ba™', the vertex that we named a? could be named aw, 
etc. It is not hard to compute products of elements in this group. To compute (a>b)\(a"b), 
we just start at the vertex labeled a’b and then travel in succession two solid arcs and 
one dashed arc, arriving at the vertex a, so (a°b)(a*b) = a. In this fashion, we could 
write out the table for this eight-element group. A 


@ EXERCISES 7 


Computations 


In Exercises 1 through 6, list the elements of the subgroup generated by the given subset. 


1. The subset {2, 3} of Zy2 2. The subset {4, 6} of Zi. 

3. The subset {8, 10} of Zis 4, The subset {12, 30} of Z36 

5, The subset {12, 42} of Z 6. The subset {18, 24, 39} of Z 
7. For the group described in Example 7.12 compute these products, using Fig. 7.11(b). 


a. (a2b)a3 b. (ab)(a*b) c. b(a*b) 


€ a 
—— 
€ -——— 74 Ge \ 
' t ] i 
b ¢ 
| 
b e+e snasas 
d f 


(a) (b) (c) 
7.13 Figure 
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In Exercises 8 through 10, give the table for the group having the indicated digraph. In each digraph, take e as 
identity element. List the identity e first in your table, and list the remaining elements alphabetically, so that 
your answers will be easy to check. 


8. The digraph in Fig. 7.13(a) 
9, The digraph in Fig. 7.13(b) 

10. The digraph in Fig. 7.13(c) 

Concepts 

11. How can we tell from a Cayley digraph whether or not the corresponding group is commutative? 

12. Referring to Exercise 11, determine whether the group corresponding to the Cayley digraph in Fig. 7.11(b) is 
commutative. 

13. Is it obvious from a Cayley digraph of a group whether or not the group is cyclic? [Hint: Look at Fig. 7.9(b).] 

14. The large outside triangle in Fig. 7.9(b) exhibits the cyclic subgroup {0, 2, 4} of Zs. Does the smaller inside 
triangle similarly exhibit a cyclic subgroup of Z,? Why or why not? 

15. The generating set S = {1,2} for Ze contains more generators than necessary, since 1 is a generator for the 
group. Nevertheless, we can draw a Cayley digraph for Z, with this generating set S. Draw such a Cayley 
digraph. 

16. Draw a Cayley digraph for Zg taking as generating set S = {2, 5}. 

17, A relation on a set S of generators of a group G is an equation that equates some product of generators and 
their inverses to the identity e of G. For example, if S = {a, b} and G is commutative so that ab = ba, then 
one relation is aba~'b~! = e. If, moreover, b is its own inverse, then another relation is b? = e. 

a. Explain how we can find some relations on S from a Cayley digraph of G. 
b. Find three relations on the set § = {a, b} of generators for the group described by Fig. 7.11(b). 

18. Draw digraphs of the two possible structurally different groups of order 4, taking as small a generating set as 
possible in each case. You need not label vertices. 

Theory 

19, Show that for n > 3, there exists a nonabelian group with 2n elements that is generated by two elements of 


order 2. 


Permutations, Cosets, 
and Direct Products 


Section 8 | Groups of Permutations 

Section 9 Orbits, Cycles, and the Alternating Groups 

Section 10 Cosets and the Theorem of Lagrange 

Section 11 Direct Products and Finitely Generated Abelian Groups 
Section 12 ‘Plane lsometries 


GROUPS OF PERMUTATIONS 


We have seen examples of groups of numbers, like the groups Z, Q, and R under 
addition. We have also introduced groups of matrices, like the group GL(2, R). Each 
element A of GL(2, R) yields a transformation of the plane R? into itself; namely, if we 
regard x as a 2-component column vector, then Ax is also a 2-component column vector. 
The group GL(2, R) is typical of many of the most useful groups in that its elements 
act on things to transform them. Often, an action produced by a group element can be 
regarded as a function, and the binary operation of the group can be regarded as function 
composition. In this section, we construct some finite groups whose elements, called 
permutations, act on finite sets. These groups will provide us with examples of finite 
nonabelian groups. We shall show that any finite group is structurally the same as some 
group of permutations. Unfortunately, this result, which sounds very powerful, does not 
turn out to be particularly useful to us. 

You may be familiar with the notion of a permutation of a set as arearrangement of the 
elements of the set. Thus for the set {1, 2, 3, 4, 5}, a rearrangement of the elements could 
be given schematically as in Fig. 8.1, resulting in the new arrangement {4, 2, 5, 3, 1}. 
Let us think of this schematic diagram in Fig. 8.1 as a function mapping of each element 
listed in the left column into a single (not necessarily different) element from the same 
set listed at the right. Thus 1! is carried into 4, 2 is mapped into 2, and so on. Furthermore, 
to be a permutation of the set, this mapping must be such that each element appears in 
the right column once and only once. For example, the diagram in Fig. 8.2 does not give 
a permutation, for 3 appears twice while | does not appear at all in the right column. We 
now define a permutation to be such a mapping. 


t Section 12 is not used in the remainder of the text. 
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8.4 Example 
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14 133 
232 22 
35 34 
433 435 
51 533 


8.1 Figure 8.2 Figure 


A permutation of a set A is a function ¢ : A — A thatis both one toone andonto. 


Permutation Groups 


We now show that function composition o is a binary operation on the collection of all 

permutations of a set A. We call this operation permutation multiplication. Let Abea 

set, and let o and r be permutations of A so that o and t are both one-to-one functions 

mapping A onto A. The composite function o o t defined schematically by 
ASA+A, 


gives a mapping of A into A. Rather than keep the symbol o for permutation multiplica- 
tion, we will denote o o t by the juxtaposition ot, as we have done for general groups. 
Now or will be a permutation if it is one to one and onto A. Remember that the action 
of ot on A must be read in right-to-left order: first apply t and then o. Let us show that 
ot is one to one. If 


(ot)(ay) = (oT)(a2), 
then 
a(t(a1)) = o(t(a2)), 


and since o is given to be one to one, we know that t(a@,) = T(a2). But then, since T is 
one to one, this gives aj = a2. Hence ot is one to one. To show that or is onto A, let 
a <A. Since o is onto A, there exists a’ ¢ A such that o(a’) =a. Since t is onto A, 
there exists a” € A such that t(a”) = a’. Thus 


a =o(a') = a(t") = (ot)@"), 


so oT is onto A. 


Suppose that 
A= {l, 2,3, 4, 5} 


and that o is the permutation given by Fig. 8.1. We write o in a more standard notation, 
changing the columns to rows in parentheses and omitting the arrows, as 


fl @ B-4 Ss 
"=\4 2 5 3 17? 
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@ HistoricaL Note 


Oz of the earliest recorded studies of permu- 
tations occurs in the Sefer Yetsirah, or Book 
of Creation, written by an unknown Jewish author 
sometime before the eighth century. The author was 
interested in counting the various ways in which 
the letters of the Hebrew alphabet can be arranged. 
The question was in some sense a mystical one. 
It was believed that the letters had magical pow- 
ers; therefore, suitable arrangements could subju- 
gate the forces of nature. The actual text of the 
Sefer Yetsirah is very sparse: “Two letters build 
two words, three build six words, four build 24 
words, five build 120, six build 720, seven build 
5040.” Interestingly enough, the idea of counting 
the arrangements of the letters of the alphabet also 
occurred in Islamic mathematics in the eighth and 
ninth centuries. By the thirteenth century, in both 
the Islamic and Hebrew cultures, the abstract idea 
of a permutation had taken root so that both Abu-l-’ 


Abbas ibn al-Banna (1256-1321), a mathematician 
from Marrakech in what is now Morocco, and Levi 
ben Gerson, a French rabbi, philosopher, and math- 
ematician, were able to give rigorous proofs that the 
number of permutations of any set of n elements is 
n!, as well as prove various results about counting 
combinations. 

Levi and his predecessors, however, were con- 
cerned with permutations as simply arrangements of 
a given finite set. It was the search for solutions of 
polynomial equations that led Lagrange and others 
in the late eighteenth century to think of permuta- 
tions as functions from a finite set to itself, the set 
being that of the roots of a given equation. And it 
was Augustin-Louis Cauchy (1789-1857) who de- 
veloped in detail the basic theorems of permutation 
theory and who introduced the standard notation 
used in this text. 


so that o(1) = 4, o (2) = 2, and so on. Let 


Then 


2 3 4 5 


1 
or= (4 23 3 


123 4 5\)_/f123 45 
35 AR Ap AS Lh BB eay 


For example, multiplying in right-to-left order, 
(ot)(1) = o(t(1)) = 93) =5. 
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A 


We now show that the collection of all permutations of a nonempty set A forms a 
group under this permutation multiplication. 


8.5 Theorem 


is a group under permutation multiplication. 


Proof 


so S4 is closed under permutation multiplication. 
Now permutation multiplication is defined as function composition, and in Section 2, 
we showed that function composition is associative. Hence &, is satisfied. 
The permutation ¢ such that «(a) = a, for all a € A acts as identity. Therefore & is 


satisfied. 


Let A be a nonempty set, and let S'4 be the collection of all permutations of A. Then S, 


We have shown that composition of two permutations of A yields a permutation of A, 
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8.6 Definition 


8.7 Example 


Permutations, Cosets, and Direct Products 


For a permutation o, the inverse function, o —| is the permutation that reverses the 
direction of the mapping o, that is, o~'(a) is the elementa’ of A such that a = o(a’). The 
existence of exactly one such element a’ is a consequence of the fact that, as a function, 
o is both one to one and onto. For each a € A we have 


(a) =a =a(a’) = o(0 a) = (oo 'Nfa) 
and also 
ua’) =a' =o7Ma)=o 'ala’)) = (a oa), 
so that o~!o0 and oo 7! are both the permutation 1. Thus G; is satisfied. ea 


Warning: Some texts compute a product o jz of permutations in left-to-right order, so 
that (o 1)(a) = (o(a)). Thus the permutation they get for oy is the one we would get 
by computing jzo. Exercise 51 asks us to check in two ways that we still get a group. 
If you refer to another text on this material, be sure to check its order for permutation 
multiplication. 


There was nothing in our definition of a permutation to require that the set A be 
finite. However, most of our examples of permutation groups will be concerned with 
permutations of finite sets. Note that the structure of the group S4 is concemed. only 
with the number of elements in the set A, and not what the elements in A are. If sets A and 
B have the same cardinality, then S4 ~ Sg. To define an isomorphism @:S4 — Sz, we 
let f : A > B be a one-to-one function mapping A onto B, which establishes that A and 
B have the same cardinality. Fora € S4, we let o(o) be the permutation € Sg such that 
6(f(a)) = f(o(a)) for all a € A. To illustrate this for A = {1, 2, 3} and B= {#, $, %} 
and the function f: A — B defined as 


fHD=#, fQ=$, FO=%, 


i 23\ 2 oft 3% 
Be oe gy BNO Nee ee, ae) 


We simply rename the elements of A in our two-row notation by elements in B using 
the renaming function f, thus renaming elements of S, to be those of Sg. We can take 
{1, 2, 3,-++, n} to be a prototype for a finite set A of n elements. 


@ maps 


Let A be the finite set {1, 2, ---,n}. The group of all permutations of A is the symmetric 
group on n letters, and is denoted by S,,. | 


Note that S,, has n! elements, where 
nt =n(n — Din — 2)--- BZ). 


Two Important Examples 


An interesting example for us is the group 5; of 3! = 6 elements. Let the set A be {1, 2,3}. 
We list the permutations of A and assign to each a subscripted Greek letter for a name. 


3 
2 
8.9 Figure 
8.10 Example 
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The reasons for the choice of names will be clear later. Let 


_f1 2 3 _ ft 
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8.8 Table 


The multiplication table for S3 is shown in Table 8.8. Note that this group is not abelian! 
We have seen that any group of at most 4 elements is abelian. Later we will see that 
a group of 5 elements is also abelian. Thus $3; has minimum order for any nonabelian 
group. A 


There is a natural correspondence between the elements of $3 in Example 8.7 and the 
ways in which two copies of an equilateral triangle with vertices 1, 2, and 3 (see Fig. 8.9 
can be placed, one covering the other with vertices on top of vertices. For this reason, 
S3 is also the group D3 of symmetries of an equilateral triangle. Naively, we used p; 
for rotations and j4; for mirror images in bisectors of angles. The notation D3 stands for 
the third dihedral group. The nth dihedral group D,, is the group of symmetries of the 
regular n-gon. See Exercise 44.1 

Note that we can consider the elements of S3 to act on the triangle in Fig. 8.9. See 
the discussion at the start of this section. 


Let us form the dihedral group D4 of permutations corresponding to the ways that two 
copies of a square with vertices 1, 2, 3, and 4 can be placed, one covering the other with 
vertices on top of vertices (see Fig. 8.11). D4 will then be the group of symmetries 
of the square. It is also called the octic group. Again, we choose seemingly arbitrary 


* Many people denote the nth dihedral group by D2, rather than by D,, since the order of the group is 2n. 
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4 3 notation that we shall explain later. Naively, we are using p; for rotations, 4; for mirror 
images in perpendicular bisectors of sides, and 4; for diagonal flips. There are eight 
permutations involved here. Let 


Hi a oe ee 
ROS NA 29g AN PLANO At de oe 
1 oe) 
: ae ee ae it oe 7 A 
8.11 Figure = = 
: a=(j 34 ys ({ 422 DE 
Lt Bo BA p(s 
a ce ae a PNG oO A? 
fl Bes ae ee ce 
RSS Ny oy 9 BY? 2=\1 4 3 2)° 
8.12 Table 
{Po Pr Hi» Ha} {Po Pi> 2 3} a \ 
(Po: wy} {00> eo} {Po, po} {Po; 5,} {Po. dy} 


{Po} 


8.13 Figure Subgroup diagram for Dy. 
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The table for D4 is given in Table 8.12. Note that D4 is again nonabelian. This group 
is simply beautiful. It will provide us with nice examples for many concepts we will 
introduce in group theory. Look at the lovely symmetries in that table! Finally, we give 
in Fig. 8.13 the subgroup diagram for the subgroups of D4. Look at the lovely symmetries 
in that diagram! A 


Cayley’s Theorem 


Look at any group table in the text. Note how each row of the table gives a permutation 
of the set of elements of the group, as listed at the top of the table. Similarly, each column 
of the table gives a permutation of the group set, as listed at the left of the table. In view 
of these observations, it is not surprising that at least every finite group G is isomorphic 
to a subgroup of the group Sg of all permutations of G. The same is true for infinite 
groups; Cayley’s theorem states that every group is isomorphic to some group consisting 
of permutations under permutation multiplication. This is a nice and intriguing result, 
and is a classic of group theory. At first glance, the theorem might seem to be a tool to 
answer ail questions about groups. What it really shows is the generality of groups of 
permutations. Examining subgroups of all permutation groups S, for sets A of all sizes 
would be a tremendous task. Cayley’s theorem does show that if a counterexample exists 
to some conjecture we have made about groups, then some group of permutations will 
provide the counterexample. 

We now proceed to the proof of Cayley’s theorem, starting with a definition and 


then a lemma that is important in its own right. 


#@ HIstTorIcAL NOTE 


ACS Cayley (1821-1895) gave an abstract- 
sounding definition of a group in a paper of 
1854: “A set of symbols, 1, a, 8,---, all of them 
different and such that the product of any two of 
them (no matter in what order) or the product of 
any one of them into itself, belongs to the set, is 
said to be a group.” He then proceeded to define a 
group table and note that every line and column of 
the table “will contain all the symbols 1, a, B,---.” 
Cayley’s symbols, however, always represented op- 
erations on sets; it does not seem that he was aware 
of any other kind of group. He noted, for instance, 
that the four matrix operations 1,a@ = inversion, 
B= transposition, and y = aw, form, abstractly, 
the non-cyclic group of four elements. In any case, 
his definition went unnoticed for a quarter of a 
century. 

This paper of 1854 was one of about 300 written 
during the 14 years Cayley was practicing law, being 


unable to find a suitable teaching post. In 1863, he 
finally became a professor at Cambridge. In 1878, 
he returned to the theory of groups by publishing 
four papers, in one of which he stated Theorem 8.16 
of this text; his “proof” was simply to notice from 
the group table that multiplication by any group el- 
ement permuted the group elements. However, he 
wrote, “this does not in any wise show that the best 
or the easiest mode of treating the general problem 
{of finding all groups of a given order] is thus to 
regard it as a problem of [permutations]. It seems 
clear that the better course is to consider the general 
problem in itself.” 

The papers of 1878, unlike the earlier one, 
found a receptive audience; in fact, they were an 
important influence on Walter Van Dyck’s 1882 ax- 
iomatic definition of an abstract group, the defini- 
tion that led to the development of abstract group 
theory. 


se et 
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8.14 Definition 


8.15 Lemma 


Proof 


8.16 Theorem 


Proof 


Permutations, Cosets, and Direct Products 


Let f : A — B bea function and let H be a subset of A. The image of H under f is 
{f(h)|h € H} and is denoted by f[H]. | 


Let G and G’ be groups and let @ : G > G’ be a one-to-one function such that @(xy) = 
$(x)@(y) for allx, y € G. Then @[G] is a subgroup of G’ and ¢ provides an isomorphism 
of G with @[G]. 


We show the conditions for a subgroup given in Theorem 8.14 are satisfied by ¢[G]. 
Let x’, y’ © @[G]. Then there exist x, y € G such that #(%) = x’ and o(y) = y’. By 
hypothesis, d(cy) = 6(x)@(y) = x'y’, showing that x’y’ € o[G]. We have shown that 
¢[G] is closed under the operation of G’. 

Let e’ be the identity of G’. Then 


e'P(e) = ble) = b(ee) = HE)PLE). 


Cancellation in G’ shows that e’ = ¢(e) so e’ € ¢[G]. 
For x’ € é[G] where x’ = o(x), we have 


é! = be) = b(xx |) = b(@)O(07') = x G(x), 


which shows that x’~! = @(x7~!) € @[G]. This completes the demonstration that ¢[G] 

is a subgroup of G’. 
That @ provides an isomorphism of G with @[G] now follows at once because @ 
provides a one-to-one map of G onto @[G] such that (xy) = $(x)(y) for all x, y € G. 
Aa 


(Cayley’s Theorem) Every group is isomorphic to a group of permutations. 


Let G be a group. We show that G is isomorphic to a subgroup of Sg. By Lemma 8.15, we 
need only to define a one-to-one function ¢ : G > Sg such that @(xy) = o(4)b()) for all 
x, y € G.Forx € G, leta, : G > G be defined by A,(g) = xg forall g € G. (We think 
of Ax as performing left multiplication by x.) The equation A,(x71c) = x(a~'c) = ¢ for 
all c € G shows that 2, maps G onto G. If A,(a@) = A,(b), then xa = xb soa =b by 
cancellation. Thus 2, is also one to one, and is a permutation of G. We now define 
¢:G— Sg by defining P(x) = A, forallx ¢ G. 

To show that ¢ is one to one, suppose that (7) = @(y). Then A, = Ay as functions 
mapping G into G. In particular 4,(¢) = Ay(e), so xe = ye and x = y. Thus ¢ is one to 
one. It only remains to show that @(xy) = @(«)o(y), thatis, thata,y = AzAy. Now for any 
g € G, we have A,,(g) = (xy)g. Permutation multiplication is function composition, so 
(AxAy)(2) = Ax(Ay(g)) = Ax(yg) = xg). Thus by associativity, Axy = AxAy. ° 


For the proof of the theorem, we could have considered equally well the permutations 
px of G defined by 


Px(g) = Bx 


for g € G. (We can think of p, as meaning right multiplication by x.) Exercise 52 shows 
that these permutations form a subgroup of S¢, again isomorphic to G, but provided by 
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amap wu: G > Sg defined by 
HX) = Py. 


8.17 Definition The map ¢ in the proof of Theorem 8.16 is the left regular representation of G, and 
the map yz in the preceding comment is the right regular representation of G. a 


8.18 Example Let us compute the left regular representation of the group given by the group table, 
Table 8.19. By “compute” we mean give the elements for the left regular representation 
and the group table. Here the elements are 


ea b : e a b e a b 
m=’ a Ne oat b at and n= (5 e sy. 


The table for this representation is just like the original table with x renamed 1,, as seen 
in Table 8.20. For example, 


For a finite group given by a group table, p, is the permutation of the elements 
corresponding to their order in the column under a at the very top, and A, is the permu- 
tation corresponding to the order of the elements in the row opposite a at the extreme 
left. The notations o, and A, were chosen to suggest right and left multiplication by a, 
respectively. 


@ EXERCISES 8 


Computation 
In Exercises 1 through 5, compute the indicated product involving the following permutations in S¢: 
_f{1 2 3 4 5 6 _f1 23 4 5 6 _f1 2 3 4 5 6 
T=\3 14 5 6 2)’ Ree Atl chose. 5)? cae tan ae Ag 
lL. to 2. to 3. wo? 4. 0?*7 5. 0 'to 


In Exercises 6 through 9, compute the expressions shown for the permutations o, t and yu defined prior to Exercise 1. 


6. |(c)| 7. |(r?)| 8. '° 9, 1 
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Partition the following collection of groups into subcollections of isomorphic groups. Here a * superscript means 
all nonzero elements of the set. 


Z under addition AY 


Le R* under multiplication 
Zo R™ under multiplication 
S6 Q* under multiplication 


17Z under addition C* under multiplication 
Q under addition The subgroup (zr) of R* under multiplication 


3Z under addition The subgroup G of S; generated by G : : : :) 


R under addition 


Let A be a set andlet o € S,. Fora fixed a € A, the set 


Oao = {o"(a)|n € Z} 


is the orbit of a under c. In Exercises 11 through 13, find the orbit of 1 under the permutation defined prior to 


Exercise 1. 
ll. o 12. t 13. uw 
14. In Table 8.8, we used po, 01, 02; Li, (42, 43 as the names of the 6 clements of S;. Some authors use the notations 


15. 
16. 
17. 
18. 


19. 


20. 


21, 


€, 2, P's G, pd, po for these elements, where their ¢ is our identity o, their p is our o, and their ¢ is our /41. 
Verify geometrically that their six expressions do give all of S3. 


With reference to Exercise 14, give a similar alternative labeling for the 8 elements of Dg in Table 8.12. 
Find the number of elements in the set {o € 54|o(3) = 3}. 

Find the number of elements in the set {0 € Ss |o(2) = 5}. 

Consider the group $3 of Example 8.7 

a. Find the cyclic subgroups (1), (92), and (1;) of 53. 

b. Find all subgroups, proper and improper, of S3 and give the subgroup diagram for them. 


Verify that the subgroup diagram for D, shown in Fig. 8.13 is correct by finding all (cyclic) subgroups generated 
by one element, then all subgroups generated by two elements, etc. 


Give the multiplication table for the cyclic subgroup of Ss generated by 


There will be six elements. Let them be p, p”, p°, p*, p°, and p° = p’. Is this group isomorphic to $3? 


a. Verify that the six matrices 
1 0 0 0 1 0 00 1 1 0 0 0 0 ft] {0 1 =O 
01 0/,10 0 1],]/1 0 O}],;0 0 1],}/0 1 OF,J1 0 0 
00 1 100 0 1 0 0 1 0 1 0 0 00 1 
form a group under matrix multiplication. [Hint: Don’t try to compute all products of these matrices. Instead, 
1 
think how the column vector | 2 | is transformed by multiplying it on the left by each of the matrices.] 
3 


b. What group discussed in this section is isomorphic to this group of six matrices? 
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(a) (b) (c) 


(Consider this part to continue infinitely to the left and right.) 
(d) 


8.21 Figure 
22. After working Exercise 21, write down eight matrices that form a group under matrix multiplication that is 
isomorphic to Da. 


In this section we discussed the group of symmetries of an equilateral triangle and of a square. In Exercises 23 
through 26, give a group that we have discussed in the text that is isomorphic to the group of symmetries of the 
indicated figure. You may want to label some special points on the figure, write some permutations corresponding 
to symmetries, and compute some products of permutations. 


23. The figure in Fig. 8.21 (a) 24. The figure in Fig. 8.21 (b) 

25. The figure in Fig. 8.21 (c) 26. The figure in Fig. 8.21 (d) 

27. Compute the left regular representation of Z,. Compute the right regular representation of $3 using the notation 
of Example 8.7. 

Concepts 


In Exercises 28 and 29, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 


28. A permutation of a set S is a one-to-one map from S to S. 


29, The left regular representation of a group G is the map of G into Sg whose value at g € G is the permutation 
of G that carries each x € G into gx. 


In Exercises 30 through 34, determine whether the given function is a permutation of R. 
30. fi :R— R defined by fix) =x+4+1 

31. fo: R > R defined by fo(x) = x? 

32. f3 : IR > R defined by f3(x) = —x° 

33. fs: R —- R defined by f,(x) = e* 

34. fs : IR — R defined by fs(x) = x? — x7 — 2x 

35. Mark each of the following true or false. 


a. Every permutation is a one-to-one function. 
b. Every function is a permutation if and only if it is one to one. 
c. Every function from a finite set onto itself must be one to one. 


d. Every group G is isomorphic to a subgroup of Sg. 
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e. Every subgroup of an abelian group is abelian. 

f. Every element of a group generates a cyclic subgroup of the group. 
g. The symmetric group 59 has 10 elements. 

h. The symmetric group $3 is cyclic. 


i. S, is not cyclic for any n. 


j. Every group is isomorphic to some group of permutations. 
36. Show by an example that every proper subgroup of a nonabelian group may be abelian. 


37. Let A be a nonempty set. What type of algebraic structure mentioned previously in the text is given by the set 
of all functions mapping A into itself under function composition? 


38. Indicate schematically a Cayley digraph for D, using a generating set consisting of a rotation through 27/n 
radians and a reflection (mirror image). See Exercise 44. 


Proof Synopsis 


39, Give a two-sentence synopsis of the proof of Cayley’s theorem. 


Theory 


In Exercises 40 through 43, let A be a set, B a subset of A, and let b be one particular element of B. Determine 
whether the given set is sure to be a subgroup of S4 under the induced operation. Here o[B] = {o(x)|x € B}. 
40. {o € S,|a(b) = 5} 41. {o € S,|o(b) € B} 

42. {o € S4lo[B] C B} 43. {o € S4|o[B] = B} 

44, In analogy with Examples 8.7 and 8.10, consider a regular plane n-gon for n > 3. Each way that two copies of 
such an n-gon can be placed, with one covering the other, corresponds to a certain permutation of the vertices. 
The set of these permutations in a group, the nth dihedral group D,,, under permutation multiplication. Find 
the order of this group D,. Argue geometrically that this group has a subgroup having just haif as many elements 
as the whole group has. 


45. Consider a cube that exactly fills a certain cubical box. As in Examples 8.7 and 8.10, the ways in which the 
cube can be placed into the box correspond to a certain group of permutations of the vertices of the cube. This 
group is the group of rigid motions (or rotations) of the cube. (It should not be confused with the group of 
symmetries of the figure, which will be discussed in the exercises of Section 12.) How many elements does this 
group have? Argue geometrically that this group has at least three different subgroups of order 4 and at least 
four different subgroups of order 3. 


46. Show that S,, is a nonabelian group for n > 3. 


47. Strengthening Exercise 46, show that if n > 3, then the only element of o of S, satisfying oy = yo for all 
y € S, iso =, the identity permutation. 

48. Orbits were defined before Exercise 11. Leta, b € A anda € S4. Show that if O,,, and Op,, have an element 
in common, then Og.¢ = Op.c- 

49, If A is a set, then a subgroup H of Sy, is transitive on A if for each a, b € A there exists o € H such that 
o(a) = b. Show that if A is a nonempty finite set, then there exists a finite cyclic subgroup A of Sq with 
|H| = |A| that is transitive on A. 

50. Referring to the definition before Exercise 11 and to Exercise 49, show that fora € Sq, {c) is transitive on A 
if and only if O,,, = A forsomea € A. 

51. (See the warning on page 78). Let G be a group with binary operation *. Let G’ be the same set as G, and 
define a binary operation *’ on G’ by x x’ y = y *x forallx, y < G’. 
a. (Intuitive argument that G’ under x’ is a group.) Suppose the front wall of your class room were made 

of transparent glass, and that all possible products a*b=c and all possible instances a * (b*c) = 
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(a x b) « c of the associative property for G under *« were written on the wall with a magic marker. What 
would a person see when looking at the other side of the wall from the next room in front of yours? 


b. Show from the mathematical definition of *' that G’ is a group under ¥’. 


52. Let G be a group. Prove that the permutations p, :G — G, where p,(x) = xa fora € G and x € G, do form 
a group isomorphic to G. 
53. A permutation matrix is one that can be obtained from an identity matrix by reordering its rows. If P is an 


n X n permutation matrix and A is any n x n matrix and C = PA, then C can be obtained from A by making 
precisely the same reordering of the rows of A as the reordering of the rows which produced P from J,. 


a. Show that every finite group of order n is isomorphic to a group consisting of n x n permutation matrices 
under matrix multiplication. 

b. For each of the four elements e, a, b, and c in the Table 5.11 for the group V, give a specific 4 x 4 matrix 
that corresponds to it under such an isomorphism. 


Orsits, CYCLES, AND THE ALTERNATING GROUPS 
Orbits 


Each permutation o of a set A determines a natural partition of A into cells with the 
property that a, b € A are in the same cell if and only if b = o"(a) for some n € Z. We 
establish this partition using an appropriate equivalence relation: 


Fora,b¢ A, leta ~ bif and only if b = o"(a) for somen € Z. (1) 
We now check that ~ defined by Condition (1) is indeed an equivalence relation. 


Reflexive Clearly a ~ a since a = la) = o (a). 
Symmetric Ifa~ ), then b=o"(a) for some n € Z. But then a =o "(b) 
and—-n € Z,sob~ a. 


Transitive Supposea ~ bandb ~ c,thenb = o”(a)andc = o”(b) forsome 
n,m € Z. Substituting, we find that c = o”(o"(a)) = o"*™(a), 
soa~c. 


9.1 Definition Let o be a permutation of a set A. The equivalence classes in A determined by the 
equivalence relation (1) are the orbits of oc. | 


9.2 Example Since the identity permutation : of A leaves each element of A fixed, the orbits of « are 
the one-element subsets of A. A 


9.3 Example Find the orbits of the permutation 


in Sg. 
Solution To find the orbit containing 1, we apply o repeatedly, obtaining symbolically 


133536315346431435.... 
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9.5 Figure 


Since o~! would simply reverse the directions of the arrows in this chain, we see that 
the orbit containing 1 is {1, 3, 6}. We now choose an integer from 1 to 8 notin {1, 3, 6}, 
say 2, and similarly find that the orbit containing 2 is {2, 8}. Finally, we find that the 
orbit containing 4 is {4, 7, 5}. Since these three orbits include all integers from 1 to 8, 
we see that the complete list of orbits of o is 


{1,3,6}, {2,8}, {4,5, 7}. A 


Cycles 


For the remainder of this section, we consider just permutations of a finite set A of n 
elements. We may as well suppose that A = {1, 2, 3, ---, } and that we are dealing with 
elements of the symmetric group S,. 

Refer back to Example 9.3. The orbits of 


123 45 67 8 
Cree, 2) 
are indicated graphically in Fig. 9.4. That is, o acts on each integer from 1 to 8 on 
one of the circles by carrying it into the next integer on the circle traveled counter- 
clockwise, in the direction of the arrows. For example, the leftmost circle indicates that 


a(1) =3,0(3) = 6, and o(6) = 1. Figure 9.4 is a nice way to visualize the structure of 
the permutation o. 


8 


9.4 Figure 


Each individual circle in Fig. 9.4 also defines, by itself, a permutation in Sg. For 
example, the leftmost circle corresponds to the permutation 


offh, 2. 34 38 OF 8 

Hee ey a @) 
that acts on 1, 3, and 6 just as o does, but leaves the remaining integers 2, 4, 5, 7, and 8 
fixed. In summary, jz has one three-element orbit {1, 3, 6} and five one-element orbits 
{2}, {4}, {5}, {7}, and {8}. Such a permutation, described graphically by a single circle, 
is called a cycle (for circle). We consider the identity permutation to be a cycle since it 


can be represented by a circle having only the integer 1, as shown in Fig. 9.5. We now 
define the term cycle in a mathematically precise way. 


9.6 Definition 


9.7 Example 


9.8 Theorem 
Proof 
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A permutation o € S, is a cycle if it has at most one orbit containing more than one 
element. The length of a cycle is the number of elements in its largest orbit. | 


To avoid the cumbersome notation, as in Eq. (3), for a cycle, we introduce a single-row 
cyclic notation. In cyclic notation, the cycle in Eq. (3) becomes 


nw = (1, 3,6). 


We understand by this notation that yz carries the first number | into the second number 3, 
the second number 3 into the next number 6, etc., until finally the last number 6 is carried 
into the first number 1. An integer not appearing in this notation for jz is understood to 
be left fixed by yz. Of course, the set on which yz acts, which is {1, 2, 3, 4, 5, 6, 7, 8} in 
our example, must be made clear by the context. 


Working within Ss, we see that 


123 4 5 
03.5.4=(5 > 5 | AT 
Observe that 
0,3,5,)=6,5,4,) = (5,4, 1,3) = (4, 1,3, 5). A 


Of course, since cycles are special types of permutations, they can be multiplied just 
as any two permutations. The product of two cycles need not again be a cycle, however. 

Using cyclic notation, we see that the permutation o in Eq. (2) can be written as a 
product of cycles: 


123 45 67 8 
= = 2 
( oe ee (1, 3, (2, 8)(4, 7, 5). (4) 


These cycles are disjoint, meaning that any integer is moved by at most one of these 
cycles; thus no one number appears in the notations of two different cycles. Equation (4) 
exhibits o in terms of its orbits, and is a one-line description of Fig. 9.4. Every permu- 
tation in S, can be expressed in a similar fashion as a product of the disjoint cycles 
corresponding to its orbits. We state this as a theorem and write out the proof. 


Every permutation o of a finite set is a product of disjoint cycles. 


Let Bi, Bo,---, B, be the orbits of o, and let yz; be the cycle defined by 


_ Jo) for x € B; 

ee ae {° otherwise. 
Clearly o = [41 f42+++[4,. Since the equivalence-class orbits B), B2,---, B, being dis- 
tinct equivalence classes, are disjoint, the cycles 441, 42,---, &, are disjoint also. Sd 


While permutation multiplication in general is not commutative, it is readily seen 
that multiplication of disjoint cycles is commutative. Since the orbits of a permutation 
are unique, the representation of a permutation as a product of disjoint cycles, none of 
which is the identity permutation, is unique up to the order of the factors. 
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9.9 Example 


9.10 Example 


9.11 Definition 


9.12 Corollary 


Permutations, Cosets, and Direct Products 


Consider the permutation 


Let us write it as a product of disjoint cycles. First, 1 is moved to 6 and then 6 to 1, giving 
the cycle (1, 6). Then 2 is moved to 5, which is moved to 3, which is moved to 2, or 
(2, 5, 3). This takes care of all elements but 4, which is left fixed. Thus 


ew re 


ee a) 1) = 0,60, 5,3). 


Multiplication of disjoint cycles is commutative, so the order of the factors (1, 6) and 
(2, 5, 3) is not important. A 


You should practice multiplying permutations in cyclic notation where the cycles 
may or may not be disjoint. We give an example and provide further practice in Exercises 
7 through 9. 


Consider the cycles (1,4,5,6) and (2,1,5) in Ss. Multiplying, we find that 


123 4 5 6 
(1.4,5,602,1.5) = (§ 43 52 ') 


and 
123 4 5 6 
(2,1,5)1,4,5,6) = ¢ 1326 a) 
Neither of these permutations is a cycle. A 
Even and Odd Permutations 
It seems reasonable that every reordering of the sequence 1, 2,..., can be achieved 


by repeated interchange of positions of pairs of numbers. We discuss this a bit more 
formally. 


A cycle of length 2 is a transposition. | 


Thus a transposition leaves all elements but two fixed, and maps each of these onto 
the other. A computation shows that 


(dy, 42, +++, An) = (G1, An G1, Gn—1) ++ * (Ai, 43 )(a1, a2). 


Therefore any cycle is a product of transpositions. We then have the following as a 
corollary to Theorem 9.8. 


Any permutation of a finite set of at least two elements is a product of transpositions. 


Naively, this corollary just states that any rearrangement of n objects can be achieved 
by successively interchanging pairs of them. 


9.13 Example 


9.14 Example 


9.15 Theorem 


Proof 1 (From 
linear algebra) 


Proof 2 
(Counting orbits) 
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Following the remarks prior to the corollary, we see that (1, 6) (2, 5, 3) is the product 
(1, 6) (2, 3) (2, 5) of transpositions. A 


In S, for m > 2, the identity permutation is the product (1, 2) (1, 2) of transpositions. 
A 


We have seen that every permutation of a finite set with at least two elements is a 
product of transpositions. The transpositions may not be disjoint, and a representation 
of the permutation in this way is not unique. For example, we can always insert at the 
beginning the transposition (1, 2) twice, because (1, 2) (1, 2) is the identity permutation. 
What is true is that the number of transpositions used to represent a given permutation 
must either always be even or always be odd. This is an important fact. We will give 
two proofs. The first uses a property of determinants from linear algebra. The second 
involves counting orbits and was suggested by David M. Bloom. 


No permutation in 5S, can be expressed both as a product of an even number of transpo- 
sitions and as a product of an odd number of transpositions. 


We remarked in Section 8 that S4 ~ Sg if A and B have the same cardinality. We 
work with permutations of the n rows of the n x n identity matrix J,, rather than of the 
numbers 1, 2,..., 2. The identity matrix has determinant 1. Interchanging any two rows 
of a square matrix changes the sign of the determinant. Let C be a matrix obtained by a 
permutation o of the rows of [,. If C could be obtained from J, by both an even number 
and an odd number of transpositions of rows, its determinant would have to be both 1 
and —1, which is impossible. Thus o cannot be expressed both as a product of an even 
number and an odd number of transpositions. 


Leto € S, and let t = (i, /) be a transposition in S,,. We claim that the number of orbits 
of o and of to differ by 1. 


CaseI Suppose i and j are in different orbits of 0. Write o as a product of 
disjoint cycles, the first of which contains j and the second of which 
contains i, symbolized by the two circles in Fig. 9.16. We may write the 
product of these two cycles symbolically as 


(b,j, X, X, XG, i, x, x) 


where the symbols x denote possible other elements in these orbits. 


9.16 Figure 
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9.17 Figure 


9.18 Definition 


9.19 Example 


Permutations, Cosets, and Direct Products 


Computing the product of the first three cycles in to = (i, j)o, we 
obtain 


(i, Jb, j, X, X, xa, i, X, x) = (a, j, X, X, XB, i, X, X). 


The original 2 orbits have been joined to form just one in To as 
symbolized in Fig. 9.16. Exercise 28 asks us to repeat the computation 
to show that the same thing happens if either one or both of i and j 
should be only element of their orbit inc. 


Case II Suppose i and j are in the same orbit of o. We can then write o asa 
product of disjoint cycles with the first cycle of the form 


(a, i, Ky, x, b,j, x, x) 


shown symbolically by the circle in Fig. 9.17. Computing the product of 
the first two cycles in to = (i, j)o, we obtain 


Gi, j,i, XX, x, 6. j,X, x)= Gj. x, x)(b, i, X, X, X). 


The original single orbit has been split into two as symbolized in 
Fig. 9.17. 


We have shown that the number of orbits of to differs from the number of 
orbits of o by 1. The identity permutation : has n orbits, because each element is 
the only member of its orbit. Now the number of orbits of a given permutation 
o € S§, differs from n by either an even or an odd number, but not both. Thus it is 
impossible to write 


GF = T7273 °°* Tb 


where the t; are transpositions in two ways, once with m even and once with m 
odd. 5 


A permutation of a finite set is even or odd according to whether it can be expressed 
as a product of an even number of transpositions or the product of an odd number of 
transpositions, respectively. | 


The identity permutation : in S, is an even permutation since we have « = (1, 2)(1, 2). 
If n = 1 so that we cannot form this product, we define : to be even. On the other hand, 
the permutation (1, 4, 5, 6) (2, 1, 5) in 5 can be written as 


(1,4, 5,6)(2,1,5 = (0, 6, 5d, H2, NE, D 


which has five transpositions, so this is an odd permutation. A 


The Alternating Groups 


We claim that for n > 2, the number of even permutations in S,, is the same as the number 
of odd permutation; that is, S,, is split equally and both numbers are (n!)/2. To show this, 
let A, be the set of even permutations in S, and let B, be the set of odd permutations 
for n > 2. We proceed to define a one-to-one function from A, onto By. This is exactly 
what is needed to show that A, and B, have the same number of elements. 


9.20 Theorem 


9.21 Definition 
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Let t be any fixed transposition in S,,; it exists since n > 2. We may as well suppose 
that 7 = (1, 2). We define a function 


Ar: An = Ba 
by 
A,(a) = to, 


thatis,o € A, ismapped into (1, 2)o by A,. Observe that since o is even, the permutation 
(1, 2)o can be expressed as a product of a (1 + even number), or odd number, of 
transpositions, so (1, 2)o is indeed in B,,. Ifforo and win A, itis true thatA,(o) = A; (fu), 
then 


(1, jo = (1, 2)y, 
and since S, is a group, we have o = yw. Thus 1, is a one-to-one function. Finally, 
2S) 
soif p € B,, then 
tp € An, 
and 
Le py =e "py =p. 


Thus i, is onto B,. Hence the number of elements in A, is the same as the number in 
B,, since there is a one-to-one correspondence between the elements of the sets. 

Note that the product of two even permutations is again even. Also since n > 2, S, 
has the transposition (1, 2) ands = (1, 2)(1, 2) is an even permutation. Finally, note that 
if o is expressed as a product of transpositions, the product of the same transpositions 
taken in just the opposite order is o~'. Thus if o is an even permutation, 0~* must also 
be even. Referring to Theorem 5.14, we see that we have proved the following statement. 


If n > 2, then the collection of all even permutations of {1, 2, 3, ---,} forms a subgroup 
of order n!/2 of the symmetric group S,,. 


The subgroup of S,, consisting of the even permutations of n letters is the alternating 
group A, on v letters. i] 


Both S, and A, are very important groups. Cayley’s theorem shows that every finite 
group G is structurally identical to some subgroup of S, for n = |G|. It can be shown 
that there are no formulas involving just radicals for solution of polynomial equations 
of degree n for n > 5. This fact is actually due to the structure of A, surprising as that 
may seem! 
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@ EXERCISES 9 


Computations 


In Exercises 1 through 6, find all orbits of the given permutation. 


1 123 4 5 6 2 12345 67 8 
“(5 13 62 4 "“\5 62 48 3 1 7 
123 45 67 8 
. : h — 
3. ( 35 1468 *) 4.¢:Z— Zwhereo(n)=n-+1 
5.0: 2— Zwhere o(n) =n +2 6. 0: Z— Z where o(n) =n — 3 


In Exercises 7 through 9, compute the indicated product of cycles that are permutations of {1, 2, 3, 4, 5, 6, 7, 8h. 
7. 1,4, 57, 8)(2, 5,7) 8. (1, 3, 2, 7)(4, 8, 6) 
9. (1, 2)(4, 7, 8), 1)C7, 2, 8, 1,5) 


In Exercises 10 through 12, express the permutation of {1, 2, 3, 4, 5, 6, 7, 8} as a product of disjoint cycles, and 
then as a product of transpositions. 


the Be SOE CO) 
NS 2G BAF ALS 1 SG Ae Te eT 


Gis 2 2 2 ee Me 
"(301472 5 8 6 


13. Recall that element a of a group G with identity element e has order r > Oif a” = e and no smaller positive 
power of a is the identity. Consider the group Sg. 


a. What is the order of the cycle (1, 4, 5, 7)? 

. State a theorem suggested by part (a). 

| What is the order of o = (4, 5)(2, 3, 7)? of r = (1, 4)03, 5, 7, 8)? 

. Find the order of each of the permutations given in Exercises 10 through 12 by looking at its decomposition 
into a product of disjoint cycles. 


e. State a theorem suggested by parts (c) and (d). [Hint: The important words you are looking for are least 
common multiple.) 


ao 


In Exercises 14 through 18, find the maximum possible order for an element of S, for the given value of n. 


14.n=5 15.n=6 16.27 =7 17. 1 =10 18. n =15 


19. Figure 9.22 shows a Cayley digraph for the alternating group A, using the generating set S = {(1, 2, 3), 
(1, 2), 4}. Continue labeling the other nine vertices with the elements of Ay, expressed as a product of 
disjoint cycles. 

Concepts 


In Exercises 20 through 22, correct the definition of the italicized term without reference to the text, if correction 
is needed, so that it is in a form acceptable for publication. 


20. For a permutation o of a set A, an orbit of o is a nonempty minimal subset of A that is mapped onto itself by o. 
21. A cycle is a permutation having only one orbit. 


22. The alternating group is the group of all even permutations. 
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(1, 2)(3, 4) 


9.22 Figure 


23. Mark each of the following true or false. 


a. Every permutation is a cycle. 


b. Every cycle is a permutation. 


c. The definition of even and odd permutations could have been given equally well before Theo- 
rem 9.15, 


d. Every nontrivial subgroup 7 of Sy containing some odd permutation contains a transposition. 
e. As has 120 elements. 

f. S, is not cyclic for any z > 1. 

8 

h 


. Az is acommutative group. 


. 57 is isomorphic to the subgroup of all those elements of Sg that leave the number 8 fixed. 
i. Sz is isomorphic to the subgroup of all those elements of Sg that leave the number 5 fixed. 
j- The odd permutations in Sg form a subgroup of Sx. 


24. Which of the permutations in $3; of Example 8.7 are even permutations? Give the table for the alternating group 
Ay. 


Proof Synopsis 
25. Give a one-sentence synopsis of Proof 1 of Theorem 9.15. 


26. Give a two-sentence synopsis of Proof 2 of Theorem 9.15. 


Theory 
27. Prove the following about S,, ifn > 3. 
a. Every permutation in S, can be written as a product of at most n — 1 transpositions. 


b. Every permutation in §, that is not a cycle can be written as a product of at most n — 2 transpositions. 


c. Every odd permutation in S,, can be written as a product of 2n + 3 transpositions, and every even permutation 
as a product of 2n + 8 transpositions. 
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28. 


29. 


30. 


31. 


32. 


33. 


34. 
35. 


36. 


37. 
38. 


39. 
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a. Draw a figure like Fig. 9.16 to illustrate that if i and j are in different orbits of o and o(7) =i, then the 
number of orbits of (i, j)o is one less than the number of orbits of o. 


b. Repeat part (a) if o(j) = j also. 
Show that for every subgroup H of S,, for n > 2, either all the permutations in H are even or exactly half of 
them are even. 
Let o be a permutation of a set A. We shall say “o moves a € A” if o(a) # a. If A is a finite set, how many 
elements are moved by acycle o € Sy of length n? 
Let A be an infinite set. Let H be the set of all o € S, such that the number of elements moved by o (see 
Exercise 30) is finite. Show that H is a subgroup of S,,. 
Let A be an infinite set. Let K be the set of all o € S,4 that move (see Exercise 30) at most 50 elements of A. 
Is K asubgroup of S4? Why? 
Consider S,, for a fixed n > 2 and let o be a fixed odd permutation. Show that every odd permutation in S,, is 
a product of o and some permutation in Ap. 
Show that if o is a cycle of odd length, then o? is a cycle. 
Following the line of thought opened by Exercise 34, complete the following with a condition involving n and 
r so that the resulting statement is a theorem: 

If o is acycle of length n, then o” is also a cycle if and only if... 
Let G be a group and let a be a fixed element of G. Show that the map 4, : G > G, given by Aq(g) = ag for 
g € G, is a permutation of the set G. 
Referring to Exercise 36, show that H = {Aq |a € G} is a subgroup of Sg, the group of all permutations of G. 
Referring to Exercise 49 of Section 8, show that H of Exercise 37 is transitive on the set G. [Hint: This is an 
immediate corollary of one of the theorems in Section 4.] 
Show that S, is generated by {(1, 2), (1, 2,3,---,#)}. [Hint: Show that as r varies, (1, 2,3,---,”)'CL, 2) 
(1,2, 3,--+,n)"” gives all the transpositions (1, 2), (2, 3), G, 4), ++, @— 1,1), (2, D. Then show that any 
transposition is a product of some of these transpositions and use Corollary 9.12] 


COSETS AND THE THEOREM OF LAGRANGE 


You may have noticed that the order of a subgroup H ofa finite group G seems always 
to be a divisor of the order of G. This is the theorem of Lagrange. We shall prove it by 
exhibiting a partition of G into cells, all having the same size as H. Thus if there are r 
such cells, we will have 


r(order of H) = (order of G) 


from which the theorem follows immediately. The cells in the partition will be called 
cosets of H, and they are important in their own right. In Section 14, we will see that if 
H satisfies a certain property, then each coset can be regarded as an element of a group 
in a very natural way. We give some indication of such coset groups in this section to 
help you develop a feel for the topic. 


Cosets 


Let H be a subgroup of a group G, which may be of finite or infinite order. We exhibit 
two partitions of G by defining two equivalence relations, ~, and ~z on G. 


10.1 Theorem 


Proof 


10.2 Definition 


10.3 Example 


Solution 
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Let H be a subgroup of G. Let the relation ~; be defined on G by 
a~zyb if and only if abe H. 
Let ~p be defined by 
a~rb  ifandonlyif ab!'eH. 
Then ~; and ~ p are both equivalence relations on G. 


We show that ~ is an equivalence relation, and leave the proof for ~z to Exercise 26. 
When reading the proof, notice how we must constantly make use of the fact that H is 
a subgroup of G. 


Reflexive Leta € G.Thena ‘a = eande € H since H is asubgroup. Thus 
a~r, a. 

Symmetric Supposea ~, b. Thena~!b € H. Since H isasubgroup, (a~'b)7! 
is in H and (a-!b)“! = ba, so ba isin H andb ~, a. 

Transitive Leta ~, bandb ~; c. Thena'b € H andb-'c € H. Since H 
is a subgroup, (a—'b\(b-'c) = ac isin H,soa~z c. Oy 


The equivalence relation ~z in Theorem 10.1 defines a partition of G, as described 
in Theorem 0.22. Let’s see what the cells in this partition look like. Suppose a € G. The 
cell containing a consists of allx € Gsuchthata ~, x, which means all x € G such that 
a7'x € H.Nowa7'x € H if and only ifa~'x =A for some h € H, or equivalently, if 
andonly ifx = ah forsomeh € H. Therefore thecell containing ais {ah |h € H}, which 
we denote by aH. If we go through the same reasoning for the equivalence relation ~z 
defined by H, we find the cell in this partition containing a € Gis Ha = {ha|h € H}. 
Since G need not be abelian, we have no reason to expect aH and Ha to be the same 
subset of G. We give a formal definition. 


Let H be a subgroup of a group G. The subset aH = {ah |h € H} of G is the left 
coset of H containing a, while the subset Ha = {ha|h € H} is the right coset of 7 
containing a. | 


Exhibit the left cosets and the right cosets of the subgroup 3Z of Z. 
Our notation here is additive, so the left coset of 3Z containing m is m + 3Z. Taking 
m = 0, we see that 

SE = {heey 90-3, 0, 3).0,9,4°| 


is itself one of its left cosets, the coset containing 0. To find another left coset, we select 
an element of Z not in 3Z, say 1, and find the left coset containing it. We have 


eee ees ee eer ea Oe ee eel eer 


These two left cosets, 3Z and 1 + 3Z, do not yet exhaust Z. For example, 2 is in neither 
of them. The left coset containing 2 is 


2+3Z={---,—-7, 4, -1,2,5, 8, 11,---}. 
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10.4 Example 


Solution 
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It is clear that these three left cosets we have found do exhaust Z, so they constitute the 
partition of Z into left cosets of 3Z. 

Since Z is abelian, the left coset m + 3Z and the right coset 3Z + m are the same, 
so the partition of Z into right cosets is the same. A 


We observe two things from Example 10.3. 


For a subgroup H of an abelian group G, the partition of G into left cosets of H 
and the partition into right cosets are the same. 


Also, looking back at Examples 0.17 and 0.20, we sec that the equivalence relation ~p 
for the subgroup nZ of Z is the same as the relation of congruence modulo n. Recall that 
h =k (modn) in Zif h — k is divisible by n. This is the same as saying that h + (—k) is 
in nZ, which is relation ~g of Theorem 10.1 in additive notation. Thus the partition of 
Z into cosets of nZ is the partition of Z into residue classes modulo n. For that reason, 
we often refer to the cells of this partition as cosets modulo nZ. Note that we do not have 
to specify left or right cosets since they are the same for this abelian group Z. 


The group Ze is abelian. Find the partition of Zs into cosets of the subgroup H = {0, 3}. 


One coset is {0, 3} itself. The coset containing | is 1 + {0,3} = {1, 4}. The coset con- 
taining 2 is 2+ {0, 3} = {2, 5}. Since {0, 3}, {1,4}, and {2, 5} exhaust all of Z,, these 
are all the cosets. A 


We point out a fascinating thing that we will develop in detail in Section 14. Referring 
back to Example 10.4, Table 10.5 gives the binary operation for Zs but with elements 
listed in the order they appear in the cosets {0, 3}, {1, 4}, (2, 5}. We shaded the table 
according to these cosets. 

Suppose we denote these cosets by LT(light), MD(medium), and DK(dark) accord- 
ing to their shading. Table 10.5 then defines a binary operation on these shadings, as 
shown in Table 10.6. Note that if we replace LT by 0, MD by 1, and DK by 2 in Table 10.6, 
we obtain the table for Z3. Thus the table of shadings forms a group! We will see in 


10.5 Table 10.6 Table 
+ |0)3)4 454 ] 

0 [0/3 |iee. 

3 13/014]1 

{ [ti4 310 
Bee Ae 


10.7 Example 


Solution 
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Section 14 that for a partition of an abelian group into cosets of a subgroup, reordering 
the group table according to the elements in the cosets always gives rise to such a coset 


group. 


Table 10.8 again shows Table 8.8 for the symmetric group $3 on three letters. Let H be 
the subgroup (14,) = {/, 41} of $3. Find the partitions of S3 into left cosets of H, and 
the partition into right cosets of H. 


For the partition into left cosets, we have 


H = {po, 1}, 
MH = {01 Po, Pi} = {P1, 3}, 
p2H = {p200, e2!1} = (02, fa}. 


The partition into right cosets is 


A = {£0, [41}, 
Hp, = {po1, Hiei} = (01, #2}; 
Hex = (0002, 4102} = (02, M3}. 


The partition into left cosets of H is different from the partition into right cosets. For 
example, the left coset containing p; is {e), 43}, while the right coset containing /, is 
{01, 42}. This does not surprise us since the group 53 is not abelian. A 


Referring to Example 10.7, Table 10.9 gives permutation multiplication in $3. The 
elements are listed in the order they appear in the left cosets {90, 41}, {o1, U3}, (02, He} 
found in that example. Again, we have shaded the table light, medium, and dark according 
to the coset to which the element belongs. Note the difference between this table and 
Table 10.5. This time, the body of the table does not split up into 2 x 2 blocks opposite 
and under the shaded cosets at the left and the top, as in Table 10.5 and we don’t get a 
coset group. The product of a light element and a dark one may be either dark or medium. 

Table 10.8 is shaded according to the two left cosets of the subgroup (1) = 
{P0, 21; P2} of S3. These are also the two right cosets, even though $3 is not abelian. 


10.8 Table 10.9 Table 


Po | M1 bp 1B 


Po | Po | 1 b Pi Bs 


Hi | Hi Po 


Proje Pry fs Po | Li 


Ha) Ms) Pr 4 ta | po 


Po | 41 |. Rr fe 


aa | pr | bs | po 
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10.10 Theorem 


Proof 


10.11 Corollary 


Proof 
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From Table 10.8 it is clear that we do have a coset group isomorphic to Zz in this case. 
We will see in Section 14 that the left cosets of a subgroup H of a group G give rise to 
a coset group precisely when the partition of G into left cosets of H is the same as the 
partition into right cosets of H. In such a case, we may simply speak of the cosets of A, 
omitting the adjective left or right. We discuss coset groups in detail in Section 14, but 
we think it will be easier for you to understand them then if you experiment a bit with 
them now. Some of the exercises in this section are designed for such experimentation. 


The Theorem of Lagrange 


Let H be a subgroup of a group G. We claim that every left coset and every right coset 
of H have the same number of elements as H. We show this by exhibiting a one-to-one 
map of H onto aleft coset gH of H for a fixed element g of G. If H is of finite order, this 
will show that gH has the same number of elements as H. If H is infinite, the existence 
of such a map is taken as the definition for equality of the size of H and the size of gH. 
(See Definition 0.13.) 

Our choice for a one-to-one map ¢ : H — gH is the natural one. Let @(h) = gh 
for each h € H. This map is onto gH by the definition of gH as {gh|h € H}. To show 
that it is one to one, suppose that d(h,) = $(2) for hy and A» in H. Then gh, = gho 
and by the cancellation law in the group G, we have h; = ho. Thus ¢ is one to one. 

Of course, a similar one-to-one map of H onto the right coset H g can be constructed. 
(See Exercise 27.) We summarize as follows: 


Every coset (left or right) of a subgroup H of a group G has the same number of 
elements as H. 


We can now prove the theorem of Lagrange. 


(Theorem of Lagrange) Let H be a subgroup of a finite group G. Then the order of 
H is a divisor of the order of G. 


Let n be the order of G, and let H have order m. The preceding boxed statement shows 
that every coset of H also has m elements. Let r be the number of cells in the partition 
of G into left cosets of H. Then n = rm, so m is indeed a divisor of n. ° 


Note that this elegant and important theorem comes from the simple counting of 
cosets and the number of elements in each coset. Never underestimate results that count 
something! We continue to derive consequences of Theorem 10.10, which should be 
regarded as a counting theorem. 


Every group of prime order is cyclic. 


Let G be of prime order p, and let a be an clement of G different from the identity. Then 
the cyclic subgroup (a) of G generated by a has at least two elements, a and e. But by 


10.12 Theorem 


Proof 


10.13 Definition 


10.14 Theorem 
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Theorem 10.10, the order m > 2 of (a) must divide the prime p. Thus we must have 
m = p and (a) = G, so G is cyclic. 5 


Since every cyclic group of order p is isomorphic to Z,, we see that there is only 
one group structure, up to isomorphism, of a given prime order p. Now doesn’t this 
elegant result follow easily from the theorem of Lagrange, a counting theorem? Never 
underestimate a theorem that counts something. Proving the preceding corollary is a 
favorite examination question. 


The order of an element of a finite group divides the order of the group. 


Remembering that the order of an element is the same as the order of the cyclic subgroup 
generated by the element, we see that this theorem follows directly from Theorem 10.10. 
Sa 


Let H be a subgroup of a group G. The number of left cosets of H in G is the index 
(G:H)of HinG. | 


The index (G : H) just defined may be finite or infinite. If G is finite, then obviously 
(G : H) is finite and (G: H) = |G|/|H|, since every coset of H contains | H| elements. 
Exercise 35 shows the index (G : H) could be equally well defined as the number of right 
cosets of H in G. We state a basic theorem concerning indices of subgroups, and leave 
the proof to the exercises (see Exercise 38). 


Suppose H and K are subgroups of a group G such that K < H < G, and suppose 
(H : K) and (G: H) are both finite. Then (G: K)is finite, and(G: K) = (G: H)(H: K). 


Theorem 10.10 shows that if there is a subgroup H of a finite group G, then the 
order of H divides the order of G. Is the converse true? That is, if G is a group of order 
n, and m divides n, is there always a subgroup of order m? We will see in the next section 
that this is true for abelian groups. However, Aa can be shown to have no subgroup of 
order 6, which gives a counterexample for nonabelian groups. 
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Computations 


1. Find all cosets of the subgroup 4Z of Z. 
2. Find all cosets of the subgroup 4Z of 2Z. 


SnD wm BB & 


. Find all cosets of the subgroup (2) of Zj2. 

. Find all cosets of the subgroup (4) of Zia. 

. Find all cosets of the subgroup (18) of Z36. 

. Find all left cosets of the subgroup {, {42} of the group D4 given by Table 8.12. 

. Repeat the preceding exercise, but find the right cosets this time. Are they the same as the left cosets? 
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. Rewrite Table 8.12 in the order exhibited by the left cosets in Exercise 6. Do you seem to get a coset group of 


order 4? If so, is it isomorphic to Z, or to the Klein 4-group V? 


. Repeat Exercise 6 for the subgroup {/9, 62} of D4. 
. Repeat the preceding exercise, but find the right cosets this time. Are they the same as the left coset? 
. Rewrite Table 8.12 in the order exhibited by the left cosets in Exercise 9. Do you seem to get a coset group of 


order 4? If so, is it isomorphic to Z, or to the Klein 4-group V? 


Find the index of (3) in the group Zo4. 


. Find the index of (1) in the group 53, using the notation of Example 10.7 
. Find the index of (42) in the group D, given in Table 8.12 
. Leto = (1, 2,5, 4)(2, 3) in Ss. Find the index of (o) in Ss. 
16. 


Let u = (1, 2, 4, 5)(3, 6) in Ss. Find the index of (12) in Se. 


Concepts 


In Exercises 17 and 18, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 


17. 
18. 
19. 


Let G be a group and let H © G. The left coset of H containing a is aH = fah|h € #}. 
Let G be a group and let H < G. The index of H in G is the number of right cosets of H in G. 
Mark each of the following true or false. 


a. Every subgroup of every group has left cosets. 
b. The number of left cosets of a subgroup of a finite group divides the order of the group. 


c. Every group of prime order is abelian. 

d. One cannot have left cosets of a finite subgroup of an infinite group. 
e. A subgroup of a group is a left coset of itself. 

f. Only subgroups of finite groups can have left cosets. 

g. 

h. 


A, is of index 2 in S, forn > 1. 
The theorem of Lagrange is a nice result. 
i. Every finite group contains an element of every order that divides the order of the group. 
j. Every finite cyclic group contains an element of every order that divides the order of the group. 


In Exercises 20 through 24, give an example of the desired subgroup and group if possible. If impossible, say why 
it is impossible. 


20. 
21. 
22. 
23. 
24. 


A subgroup of an abelian group G whose left cosets and right cosets give different partitions of G 
A subgroup of a group G whose left cosets give a partition of G into just one cell 

A subgroup of a group of order 6 whose left cosets give a partition of the group into 6 cells 

A subgroup of a group of order 6 whose left cosets give a partition of the group into 12 cells 


A subgroup of a group of order 6 whose left cosets give a partition of the group into 4 cells 


Proof Synopsis 


25. 


Give a one-sentence synopsis of the proof of Theorem 10.10. 


Theory 


26. 
27. 


Prove that the relation ~p of Theorem 10.1 is an equivalence relation. 


Let H be a subgroup of a group G and let g € G. Define a one-to-one map of H onto Hg. Prove that your map 
is one to one and is onto Hg. 


28. 


29, 
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Let H be a subgroup of a group G such that g~!hg € H forall g ¢ Gandallh € H. Show that every left coset 
gH is the same as the right coset Hg. 


Let H be a subgroup of a group G. Prove that if the partition of G into left cosets of H is the same as the 
partition into right cosets of H, then g~'hg € H forall g € Gand allh € H. (Note that this is the converse of 
Exercise 28.) 


Let H be a subgroup of a group G and let a, b € G. In Exercises 30 through 33 prove the statement or give a 
counterexample. 


30. 
31. 
32. 
33. 
34. 


35. 


36. 


37. 


38. 


39. 
40. 
41. 


42. 


43. 


44. 


IfaH = bH, then Ha = Ab. 

If Ha = Ab, thenb € Ha. 

If aH = bH, then Ha! = Hb7}. 

If aH = bH, thena*H =b"H. 

Let G be a group of order pq, where p and q are prime numbers. Show that every proper subgroup of G is 
cyclic. 


Show that there are the same number of left as right cosets of a subgroup H of a group G; that is, exhibit 
a one-to-one map of the collection of left cosets onto the collection of right cosets. (Note that this result is 
obvious by counting for finite groups. Your proof must hold for any group.) 


Exercise 29 of Section 4 showed that every finite group of even order 2n contains an element of order 2. Using 
the theorem of Lagrange, show that ifn is odd, then an abelian group of order 2n contains precisely one element 
of order 2. 


Show that a group with at least two elements but with no proper nontrivial subgroups must be finite and of 
prime order. 


Prove Theorem 10.14 [Hint: Let {a;H |i = 1, ---,r} be the collection of distinct left cosets of H in G and 
{bj K | j =1,---,s} be the collection of distinct left cosets of K in H. Show that 

{(a;bj))K |i = leery = 1,-++, 5s} 
is the collection of distinct left cosets of K in G.] 
Show that if H is a subgroup of index 2 ina finite group G, then every left coset of H is also aright coset of H. 
Show that if a group G with identity ¢ has finite order n, then a” = e foralla € G. 


Show that every left coset of the subgroup Z of the additive group of real numbers contains exactly one element 
x such thatO <x <1. 


Show that the function sine assigns the same value to each element of any fixed left coset of the subgroup (277) 
of the additive group R of real numbers. (Thus sine induces a well-defined function on the set of cosets; the 
value of the function on a coset is obtained when we choose an element x of the coset and compute sin x.) 


Let H and K be subgroups of a group G. Define ~ on G by a ~ b if and only if a = hbk for some h € H and 

some k € K. 

a. Prove that ~ is an equivalence relation on G. 

b. Describe the elements in the equivalence class containing a € G. (These equivalence classes are called 
double cosets.) 

Let $4 be the group of all permutations of the set A, and let c be one particular element of A. 


a. Show that {o € S4|o(c) =c} is a subgroup S,... of S4. 

b. Let d # c be another particular element of A. Is S,.¢ = {o € S4|o(c) = d} a subgroup of $4? Why or why 
not? 

c. Characterize the set S,.¢ of part (b) in terms of the subgroup S,., of part (a). 
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45. Show that a finite cyclic group of order n has exactly one subgroup of each order d dividing n, and that these 
are all the subgroups it has. 


46. 


47. 


The Euler phi-function is defined for positive integers n by y(n) = s, where s is the number of positive integers 


less than or equal to n that are relatively prime to n. Use Exercise 45 to show that 


n=) ¢(d), 


d|n 


the sum being taken over all positive integers d dividing n. [Hint: Note that the number of generators of Zg is 
g(d) by Corollary 6.16.] 


Let G be a finite group. Show that if for each positive integer m the number of solutions x of the equation 


x” =e inG is at most m, then G is cyclic. [Hint: Use Theorem 10.12 and Exercise 46 to show that G must 
contain an element of order n = jG].] 


11.1 Definition 


DIRECT PRODUCTS AND FINITELY GENERATED ABELIAN GROUPS 
Direct Products 


Let us take a moment to review our present stockpile of groups. Starting with finite 
groups, we have the cyclic group Z,, the symmetric group S,, and the alternating group 
A, for each positive integer n. We also have the dihedral groups D, of Section 8, and the 
Klein 4-group V. Of course we know that subgroups of these groups exist. Turning to 
infinite groups, we have groups consisting of sets of numbers under the usual addition or 
multiplication, as, for example, Z, R, and C under addition, and their nonzero elements 
under multiplication. We have the group U of complex numbers of magnitude 1 under 
multiplication, which is isomorphic to each of the groups R, under addition modulo c, 
where c € R*. We also have the group Sj, of all permutations of an infinite set A, as 
well as various groups formed from matrices. 

One purpose of this section is to show a way to use known groups as building blocks 
to form more groups. The Klein 4-group will be recovered in this way from the cyclic 
groups. Employing this procedure with the cyclic groups gives us a large class of abelian 
groups that can be shown to include all possible structure types for a finite abelian group. 
We start by generalizing Definition 0.4. 


The Cartesian product of sets 5), S:,---,5S, is the set of all ordered n-tuples 
(a1, d2,++*,d,), where a; € S; fori = 1, 2,-+-,n. The Cartesian product is denoted 
by cither 


Sy xX $2. x +++ x Sy 


or by 


fs. : 
i=l 


We could also define the Cartesian product of an infinite number of sets, but the 
definition is considerably more sophisticated and we shall not need it. 

Now let G1, G2, ---, G, be groups, and let us use multiplicative notation for all 
the group operations. Regarding the G; as sets, we can form [];_, G;. Let us show that 
we can make [];_, G; into a group by means of a binary operation of multiplication by 


11.2 Theorem 


Proof 


11.3 Example 
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components. Note again that we are being sloppy when we use the same notation for a 
group as for the set of elements of the group. 


Let Gj, Gz, +++, Gy be groups. For (a, a2,--+,a,) and (b, bo, ---, by) in jes G;, 
define (a1, a2,+++,a,)(b1, bx, +++, by) to be the element (a,)1, dab2, +--+, d,b,). Then 
T]j-1 G; is a group, the direct product of the groups G;, under this binary operation. 


Note that since a; € G;, b; € G;, and G; is a group, we have a;b; € G;. Thus the defi- 
nition of the binary operation on []j_, G; given in the statement of the theorem makes 
sense; that is, [];_, G; is closed under the binary operation. 

The associative law in [];_, G; is thrown back onto the associative law in each 
component as follows: 


(a), 42, -++, Qn (by, b2, >>, Baer, €2, +> On) 
= (a, 42, +++, G1 C1, byc2, +++. Baca) 
= (a (b1¢1), a2(b2C2), +++, Gn(Onen)) 
= ((a,by Jey, (a2b2)e2, +++, (Anbn den) 
= (ay by, Arba, +++, AnPy (C1, C2, +++ Ca) 
= [(41, 42, +++, Gn)(B1, b2, +++, bn) Ce, €25 +++ 5 en): 


If e; is the identity element in G;, then clearly, with multiplication by components, 


(€], €2,+++,@,) 18 an identity in The: G;. Finally, an inverse of (a), a@2,--+, dy) is 
a ay 1 ...,a71); compute the product by components. Hence []'_, G; is a group. 
¢ 


In the event that the operation of each G; is commutative, we sometimes use additive 
notation in [];_, G; and refer to []}_, G; as the direct sum of the groups G;. The 
notation @"_, G; is sometimes used in this case in place of []_, Gi, especially with 
abelian groups with operation +. The direct sum of abelian groups G1, G2,---, Gy may 
be written Gj @ Gz @--- ® Gy. We leave to Exercise 46 the proof that a direct product 
of abelian groups is again abelian. 

It is quickly seen that if the S; has r; elements for i = 1,---,n, then []}_, S; has 
ryr2 +++, elements, for in an n-tuple, there are r; choices for the first component from 
S,, and for each of these there are rz choices for the next component from 5, and so on. 


Consider the group Z, x Z3, which has 2 - 3 = 6 elements, namely (0, 0), (0, 1), (0, 2), 
(1, 0), , 1), and (1, 2). We claim that Z, x Zs is cyclic. It is only necessary to find a 
generator. Let us try (1, 1). Here the operations in Z, and Z3 are written additively, so 
we do the same in the direct product Z, x Zs. 


gd,)b=d,)) 
20, 1) = (1, 1) +, 1) = (0, 2) 
30,0 =d,D04+d,04+d, 1) =d,9) 
40,0) =30.04+d,D=d,0)+ d, 1) =, 1) 
5,1) =40,)4+d,)=(0,D+d, 1) =d, 2) 
60,1) =50,D+d,1 =d,2)4+ d, 1D) = ©, 0) 
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Proof 


11.6 Corollary 


11.7 Example 
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Thus (1, 1) generates all of Z. x Z3. Since there is, up to isomorphism, only one cyclic 
group structure of a given order, we see that Z; x Z; is isomorphic to Ze. A 


Consider Z, x Z3. This is a group of nine elements. We claim that Z3 x Zs is not cyclic. 
Since the addition is by components, and since in Z; every element added to itself three 
times gives the identity, the same is true in Z; x Z3. Thus no element can generate the 
group, for a generator added to itself successively could only give the identity after nine 
summands. We have found another group structure of order 9. A similar argument shows 
that Zz x Zo is not cyclic. Thus Z; x Zz must be isomorphic to the Klein 4-group.  & 


The preceding examples illustrate the following theorem: 


The group Zm x Zn is cyclic and is isomorphic to Zm, if and only if m and n are relatively 
prime, that is, the gcd of m and nis 1. 


Consider the cyclic subgroup of Zm x Zn generated by (1, 1) as described by Theorem 
5.17. As our previous work has shown, the order of this cyclic subgroup is the smallest 
power of (1, 1) that gives the identity (0, 0). Here taking a power of (1, 1) in our additive 
notation will involve adding (1, 1) to itself repeatedly. Under addition by components, 
the first component 1 € Z,, yiclds 0 only after m summands, 2m summands, and so on, 
and the second component 1 € Z, yields 0 only after n summands, 2 summands, and 
so on. For them to yield 0 simultaneously, the number of summands must be a multiple 
of both m and n. The smallest number that is a multiple of both m and n will be mn if 
and only if the ged of m and n is 1; in this case, (1, 1) generates a cyclic subgroup of 
order mn, which is the order of the whole group. This shows that Z, x Z, is cyclic of 
order mn, and hence isomorphic to Z, if m and n are relatively prime. 

For the converse, suppose that the ged of m and n is d > 1. The mn/d is divisible 
by both m and n. Consequently, for any (7, 8) in Zm x Zn, we have 


(r,s) +, 5) +---+(%, 5) = (0, 9). 
mn/d summands 


Hence no element (r,s) in Z, X Z, can generate the entire group, so Zm X Zn is 
not cyclic and therefore not isomorphic to Zinn. ¢ 


This theorem can be extended to a product of more than two factors by similar 
arguments. We state this as a corollary without going through the details of the proof. 


The group | ]j_, Zm, is cyclic and isomorphic to Zmm,.-m, if and only if the numbers m; 
fori = 1,---, are such that the gcd of any two of them is 1. 


The preceding corollary shows that if n is written as a product of powers of distinct prime 
numbers, as in 


n= (p1)"' (po) +++ (pr), 
then Z, is isomorphic to 
Zep X Lpyya X +++ X Zep,yr 
In particular, Z72 is isomorphic to Zg x Zo. A 


11.8 Definition 


11.9 Theorem 


Proof 


11.10 Example 


Solution 


11.11 Example 
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We remark that changing the order of the factors in a direct product yields a group 
isomorphic to the original one. The names of elements have simply been changed via a 
permutation of the components in the m-tuples. 

Exercise 47 of Section 6 asked you to define the least common multiple of two 
positive integers r and s as a generator of a certain cyclic group. It is straightforward to 
prove that the subset of Z consisting of all integers that are multiples of both 7 and s is 
a subgroup of Z, and hence is a cyclic group. Likewise, the set of all common multiples 
of n positive integers r), r2, ---, Ff, is a subgroup of Z, and hence is cyclic. 


Let 71,72, --+,1, be positive integers. Their least common multiple (abbreviated lcm) 
is the positive generator of the cyclic group of all common multiples of the r;, that is, 
the cyclic group of all integers divisible by each r; fori = 1, 2,---,n. | 


From Definition 11.8 and our work on cyclic groups, we see that the lemofr;, r2,---, 
r, is the smallest positive integer that is a multiple of each r; fori = 1, 2,---,n, hence 
the name least common multiple. 


Let (a), a2,-*+,4n) € ]]}_, Gi. If a; is of finite order r; in G;, then the order of 
(ay, 42,°++,%)in We G; is equal to the least common multiple of all the 7;. 


This follows by a repetition of the argument used in the proof of Theorem 11.5. For a 
power of (a1, a2, +++, G) to give (e), €2,-++, @,), the power must simultaneously be a 
multiple of r; so that this power of the first component a, will yield e;, a multiple of r2, 
so that this power of the second component @ will yield e2, and so on. 5 


Find the order of (8, 4, 10) in the group Z;2 x Zeq x Zag. 


Since the gcd of 8 and 12 is 4, we see that 8 is of order B = 3 in Z>. (See Theorem 6.14.) 
Similarly, we find that 4 is of order 15 in Zo and 10 is of order 12 in Zo4. The Iem 
of 3, 15, and 12 is 3-5-4 = 60, so (8, 4, 10) is of order 60 in the group Z 12 x Ziggy x 
Zi. A 
The group Z x Z» is generated by the elements (1, 0) and (0, 1). More generally, the 
direct product of n cyclic groups, each of which is either Z or Z, for some positive 
integer m, is generated by the n n-tuples 


(1, 0, 0, ---, 0), (0, 1,0,--+, 0), (0,0, 1, +++, 0), wees. (0, 0,0, +--+, 1). 


Such a direct product might also be generated by fewer elements. For example, Z3 x 
Za xX 235 is generated by the single element (1, 1, 1). A 


Note that if []/_, G; is the direct product of groups G;, then the subset 
Gi = {(e1, 2, +++, Gi—15 Gi, i415 °° €n) [ai € Gi}, 


that is, the set of all n-tuples with the identity elements in all places but the ith, is a 
subgroup of []/_, Gi. Itis also clear that this subgroup G; is naturally isomorphic to G;; 
just rename 


(€1, €2, +++, €f—-15 Gi, Ci41,°°*» €n) by Qj. 
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The group G; is mirrored in the ith component of the elements of G;, and the e; in 
the other components just ride along. We consider []}_, G; to be the internal direct 
product of these subgroups G;. The direct product given by Theorem 11.2 is called the 
external direct product of the groups G;. The terms internal and external, as applied to 
a direct product of groups, just reflect whether or not (respectively) we are regarding the 
component groups as subgroups of the product group. We shall usually omit the words 
external and internal and just say direct product. Which term we mean will be clear from 


the context. 


# HistoricaL NOTE 


n his Disquisitiones Arithmeticae, Carl Gauss 

demonstrated various results in what is today the 
theory of abelian groups in the context of num- 
ber theory. Not only did he deal extensively with 
equivalence classes of quadratic forms, but he also 
considered residue classes modulo a given integer. 
Although he noted that results in these two areas 
were similar, he did not attempt to develop an ab- 
stract theory of abelian groups. 

In the 1840s, Ernst Kummer in dealing with 
ideal complex numbers noted that his results were in 
many respects analogous to those of Gauss. (See the 
Historical Note in Section 26.) But it was Kummer’s 
student Leopold Kronecker (see the Historical Note 
in Section 29) who finally realized that an abstract 


theory could be developed out of the analogies. As 
he wrote in 1870, “these principles [from the work 
of Gauss and Kummer} belong to a more general, 
abstract realm of ideas. It is therefore appropriate 
to free their development from all unimportant re- 
strictions, so that one can spare oneself from the 
necessity of repeating the same argument in differ- 
ent cases. This advantage already appears in the de- 
velopment itself, and the presentation gains in sim- 
plicity, if itis given in the most general admissible 
manner, since the most important features stand out 
with clarity.’ Kronecker then proceeded to develop 
the basic principles of the theory of finite abelian 
groups and was able to state and prove a version of 
Theorem 11.12 restricted to finite groups. 


ee 


11.12 Theorem 


The Structure of Finitely Generated Abelian Groups 


Some theorems of abstract algebra are easy to understand and use, although their proofs 
may be quite technical and time-consuming to present. This is one section in the text 
where we explain the meaning and significance of a theorem but omit its proof. The 
meaning of any theorem whose proof we omit is well within our understanding, and 
we feel we should be acquainted with it. It would be impossible for us to meet some of 
these fascinating facts in a one-semester course if we were to insist on wading through 
complete proofs of all theorems. The theorem that we now state gives us complete 
structural information about all sufficiently small abelian groups, in particular, about all 
finite abelian groups. 


(Fundamental Theorem of Finitely Generated Abelian Groups) Every finitely gen- 
erated abelian group G is isomorphic to a direct product of cyclic groups in the form 


LZ(p,y x Zipsy? Kr xK Zep, ¥" x ZX ZK x Z, 


Proof 


11.13 Example 


Solution 


11.14 Definition 


11.15 Theorem 


Proof 


11.16 Theorem 
Proof 
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where the p; are primes, not necessarily distinct, and the 7; are positive integers. The 
direct product is unique except for possible rearrangement of the factors; that is, the 
number (Betti number of G) of factors Z is unique and the prime powers (p,)" are 
unique. 


The proof is omitted here. + 


Find all abelian groups, up to isomorphism, of order 360. The phrase up to isomorphism 
signifies that any abelian group of order 360 should be structurally identical (isomorphic) 
to one of the groups of order 360 exhibited. 


We make use of Theorem 11.12. Since our groups are to be of the finite order 360, no 
factors Z will appear in the direct product shown in the statement of the theorem. 
First we express 360 as a product of prime powers 27375. Then using Theorem 11.12, 
we get as possibilities 
1 Bxhexhx2,;x2,~x Zs 
Zo x La x Dy x Ly x Bs 
Zo x Ly x Ly X Ly X Ls 
Zn Xx £4 x Zo x Bs 
Zg x Z3 x £3 x Ls 
Zg xX Ly x Zs 


AM Pw SN 


Thus there are six different abelian groups (up to isomorphism) of order 360. A 


Applications 


We conclude this section with a sampling of the many theorems we could now prove 
regarding abelian groups. 


A group G is decomposable if itis isomorphic to a direct product of two proper nontrivial 
subgroups. Otherwise G is indecomposable. a 


The finite indecomposable abelian groups are exactly the cyclic groups with order a 
power of a prime. 


Let G be a finite indecomposable abelian group. Then by Theorem 11.12, Gis isomorphic 
to a direct product of cyclic groups of prime power order. Since G is indecomposable, 
this direct product must consist of just one cyclic group whose order is a power of a 
prime number. 

Conversely, let p be a prime. Then Zp is indecomposable, for if Zp» were isomor- 
phic to Z, x Zp, where i+ j =r, then every element would have an order at most 
pray) < p. rs 


If m divides the order of a finite abelian group G, then G has a subgroup of order m. 
By Theorem 11.12, we can think of G as being 


Zipyi X Zipyr K +++ X Zep, yny 
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where not all primes p; need be distinct. Since (p1)"(p2)” --- (pay is the order of G, 
then m must be of the form (771) (p2)” «++ (pny, where 0 < s; < 7;. By Theorem 6.14, 
(p;y— generates a cyclic subgroup of Zip, of order equal to the quotient of (p;)" by 
the gcd of (p;)" and (p;)"—. But the ged of (p;)" and (p;)"~* is (pi yi, Thus (p,)" 
generates a cyclic subgroup Zcp,y; of order 
[pi 1/1 = a”. 
Recalling that (a) denotes the cyclic subgroup generated by a, we see that 
(pr) x (Cary?) x + x (pny) 
is the required subgroup of order m. ¢ 


If m is a square free integer, that is, m is not divisible by the square of any prime, then 
every abelian group of order m is cyclic. 


Let G be an abelian group of square free order m. Then by Theorem 11.12, G is isomor- 
phic to 

Zipy X Zcp,yr X +++ X Zipyyns 
where m = (p1)"' (p72)? --- (pn). Since m is square free, we must have all r; = | and 


all p; distinct primes. Corollary 11.6 then shows that G is isomorphic to Zp, p,...p,» 80 G 
is cyclic. 4 


@ EXERCISES 11 


Computations 


1. List the elements of Z, x Z4. Find the order of each of the elements. Is this group cyclic? 


2. Repeat Exercise 1 for the group Z3 x Zq4. 


In Exercises 3 through 7, find the order of the given element of the direct product. 


3. 
6. 
. What is the largest order among the orders of all the cyclic subgroups of Zs x Zs? of Zj2 x Zy5? 


(2, 6) in Z4 x Zy2 


(3, 10, 9) in Zs x 


4. (2,3)in Ze x Zy5 5. (8, 10) in Zi2 x Zig 
Zi2 x Zy5 oi G3, 6, 12, 16) in La x Z2 x Zi x Za 


. Find all proper nontrivial subgroups of Z2 x Zp. 

. Find all proper nontrivial subgroups of Z) x Zz x Zp. 

. Find all subgroups of Zz x Za of order 4. 

. Find all subgroups of Z x Z2 x Zz, that are isomorphic to the Klein 4-group. 


. Disregarding the order of the factors, write direct products of two or more groups of the form Z,, so that the 


resulting product is isomorphic to Zo in as many ways as possible. 


. Fill in the blanks. 


a. The cyclic subgroup of Z24 generated by 18 has order. 
b. Z3 x Za is of order__. 


15. 
16. 
17. 
18. 
19. 
20. 
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c. The element (4, 2) of Zj2 x Zg has order_. 

d. The Klein 4-group is isomorphic to Z__ x Z_. 

e. Z) x Z x Z4 has__elements of finite order. 

Find the maximum possible order for some element of Z4 x Ze. 

Are the groups Z2 x Z2 and Z4 x Ze isomorphic? Why or why not? 

Find the maximum possible order for some element of Zs x Zio x Zz. 

Are the groups Zg x Zio X Zo4 and Zy x Zi2 X Zao isomorphic? Why or why not? 
Find the maximum possible order for some element of Z, x Zig x Zs. 


Are the groups Z4 x Zig x Zs and Z3 x Z36 x Zio isomorphic? Why or why not? 


In Exercises 21 through 25, proceed as in Example 11.13 to find all abelian groups, up to isomorphism, of the given 
order. 


21. 
24. 


26. 
27. 


28. 
29, 


30. 
31. 


Order 8 22. Order 16 23. Order 32 
Order 720 25. Order 1089 


How many abelian groups (up to isomorphism) are there of order 24? of order 25? of order (24)(25)? 


Following the idea suggested in Exercise 26, let m and a be relatively prime positive integers. Show that if 
there are (up to isomorphism) r abelian groups of order m and s of order n, then there are (up to isomorphism) 
rs abelian groups of order mn. 


Use Exercise 27 to determine the number of abelian groups (up to isomorphism) of order (10). 


a. Let p be a prime number. Fill in the second row of the table to give the number of abelian groups of order p”, 
up to isomorphism. 


number of groups [| {| = | [ [| | 


b. Let p,q, andr be distinct prime numbers. Use the table you created to find the number of abelian groups, 
up to isomorphism, of the given order. 
i. peqtr? ii. (gr)! fii, g>r4g? 
Indicate schematically a Cayley digraph for Z,, x Z, for the generating set S = {(1, 0), (0, 1)}. 
Consider Cayley digraphs with two arc types, a solid one with an arrow and a dashed one with no arrow, 
and consisting of two regular n-gons, for n > 3, with solid arc sides, one inside the other, with dashed arcs 
joining the vertices of the outer n-gon to the inner one. Figure 7.9(b) shows such a Cayley digraph with n = 3, 
and Figure 7.11(b) shows one with » = 4. The arrows on the outer n-gon may have the same (clockwise or 
counterclockwise) direction as those on the inner n-gon, or they may have the opposite direction. Let G be a 
group with such a Cayley digraph. 


a. Under what circumstances will G be abelian? 

b. If G is abelian, to what familiar group is it isomorphic? 

c. If G is abelian, under what circumstances is it cyclic? 

d. If G is not abelian, to what group we have discussed is it isomorphic? 
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32. Mark each of the following true or false. 
a. If G; and G2 are any groups, then G; x G2 is always isomorphic to G2 x G1. 
b. Computation in an external direct product of groups is easy if you know how to compute in each 
component group. 
c. Groups of finite order must be used to form an external direct product. 
d. A group of prime order could not be the internal direct product of two proper nontrivial subgroups. 
e. Z. x Zq is isomorphic to Zs. 
f. Z, x Z4 is isomorphic to Ss. 
g. Z; x Zs is isomorphic to S4. 
h. Every element in Z4 x Zg has order 8. 
i. The order of Z)2 x Zs is 60. 
je Zm X Zn has mn elements whether m and n are relatively prime or not. 
33. Give an example illustrating that not every nontrivial abelian group is the internal direct product of two proper 
nontrivial subgroups. 
34, a. How many subgroups of Zs x Ze are isomorphic to Zs x Ze? 
b. How many subgroups of Z x Z are isomorphic to Z x Z? 
35. Give an example of a nontrivial group that is not of prime order and is not the internal direct product of two 
nontrivial subgroups. 
36. Mark each of the following true or false. 
a. Every abelian group of prime order is cyclic. 
pb. Every abelian group of prime power order is cyclic. 
c. Zg is generated by {4, 6}. 
d. Zg is generated by {4, 5, 6}. 
e. All finite abelian groups are classified up to isomorphism by Theorem 11.12. 
f. Any two finitely generated abelian groups with the same Betti number are isomorphic. 
g. Every abelian group of order divisible by 5 contains a cyclic subgroup of order 5. 
h. Every abelian group of order divisible by 4 contains a cyclic subgroup of order 4. 
i. Every abelian group of order divisible by 6 contains a cyclic subgroup of order 6. 
j. Every finite abelian group has a Betti number of 0. 
37. Let p and q be distinct prime numbers. How does the number (up to isomorphism) of abelian groups of order p’ 
compare with the number (up to isomorphism) of abelian groups of order q”? 
38. Let G be an abelian group of order 72. 
a. Can you say how many subgroups of order 8 G has’? Why, or why not? 
b. Can you say how many subgroups of order 4 G has? Why, or why not? 
39. Let G be an abelian group. Show that the elements of finite order in G forma subgroup. This subgroup is called 


the torsion subgroup of G. 


Exercises 40 through 43 deal with the concept of the torsion subgroup just defined. 


40. 


Find the order of the torsion subgroup of Z4 x Z x Z3; of Zin X ZX Zy0. 
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41. Find the torsion subgroup of the multiplicative group R* of nonzero real numbers. 
42. Find the torsion subgroup T of the multiplicative group C* of nonzero complex numbers. 


43, An abelian group is torsion free if ¢ is the only element of finite order. Use Theorem 11.12 to show that 
every finitely generated abelian group is the internal direct product of its torsion subgroup and of a torsion-free 
subgroup. (Note that {e} may be the torsion subgroup, and is also torsion free.) 


44. The part of the decomposition of G in Theorem 11.12 corresponding to the subgroups of prime-power order 
can also be written in the form Zm, x Zm, X +++ X Zm,, where m; divides mj; fori = 1,2,---,7 — 1. The 
numbers m; can be shown to be unique, and are the torsion coefficients of G. 


a. Find the torsion coefficients of Z4 x Zo. 
b. Find the torsion coefficients of Ze x Zi2 x Zoo. 
c. Describe an algorithm to find the torsion coefficients of a direct product of cyclic groups. 


Proof Synopsis 


45. Give a two-sentence synopsis of the proof of Theorem 11.5. 


Theory 


46. Prove that a direct product of abelian groups is abelian. 


47. Let G be an abelian group. Let H be the subset of G consisting of the identity e together with all elements of 
G of order 2. Show that H is a subgroup of G. 

48. Following up the idea of Exercise 47 determine whether H will always be a subgroup for every abelian group 
G if H consists of the identity e together with all elements of G of order 3; of order 4. For what positive 
integers n will H always be a subgroup for every abelian group G, if H consists of the identity e together with 
all elements of G of order n? Compare with Exercise 48 of Section 5. 


49, Find a counterexample of Exercise 47 with the hypothesis that G is abelian omitted. 


Let H and K be subgroups of a group G. Exercises 50 and 51 ask you to establish necessary and sufficient criteria 
for G to appear as the internal direct product of H and K. 


50. Let H and K be groups and let G = H x K. Recall that both H and K appear as subgroups of G ina natural 
way. Show that these subgroups H (actually H x {e}) and K (actually {e} x K) have the following properties. 


a. Every element of G is of the form hk for some h € H andk € K. 
b. Ak =kh foralh ¢ Handke K. e« HOOK = {e}. 
51. Let H and K be subgroups of a group G satisfying the three properties listed in the preceding exercise. Show 


that for each g € G, the expression g = hk forh € H andk ¢€ K is unique. Then let each g be renamed (h, k). 
Show that, under this renaming, G becomes structurally identical (isomorphic) to H x K. 


52. Show that a finite abelian group is not cyclic if and only if it contains a subgroup isomorphic to Z, x Zp for 
some prime p. 

53. Prove that if a finite abelian group has order a power of a prime p, then the order of every element in the group 
is a power of p. Can the hypothesis of commutativity be dropped? Why, or why not? 


54. Let G, H, and K be finitely generated abelian groups. Show that if G x K is isomorphic to H x K, then 
G~ dH. 
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T PLANE ISOMETRIES 


Consider the Euclidean plane R?. An isometry of IR? is a permutation ¢ : R’ > R? 
that preserves distance, so that the distance between points P and Q is the same as 
the distance between the points @(P) and ¢(Q) for all points P and Q in R?. If w is 
also an isometry of R*, then the distance between y(@(P)) and w(@(Q)) must be the 
same as the distance between #(P) and @(Q), which in turn is the distance between P 
and Q, showing that the composition of two isometries is again an isometry. Since the 
identity map is an isometry and the inverse of an isometry is an isometry, we see that the 
isometries of R? form a subgroup of the group of all permutations of R’. 

Given any subset S of R?, the isometries of R® that carry S onto itself form a 
subgroup of the group of isometries. This subgroup is the group of symmetries of S in 
IR2. In Section 8 we gave tables for the group of symmetries of an equilateral triangle 
and for the group of symmetries of a square in R’. 

Everything we have defined in the two preceding paragraphs could equally well 
have been done for n-dimensional Euclidean space R”, but we will concern ourselves 
chiefly with plane isometries here. 

It can be proved that every isometry of the plane is one of just four types (see Artin 
[5]). We will list the types and show, for each type, a labeled figure that can be carried 
into itself by an isometry of that type. In each of Figs. 12.1, 12.3, and 12.4, consider the 
line with spikes shown to be extended infinitely to the left and to the right. We also give 
an example of each type in terms of coordinates. 


translation t: Slide every point the same distance in the same direction. See 
Fig. 12.1. (Example: t(x, y) = (x,y) + 2, -3) = @ + 2,y—3)) 

rotation p: Rotate the plane about a point P through an angle @. See Fig. 12.2. 
(Example: p(x, y) = (—y, x) is a rotation through 90° counterclockwise about the 
origin (0, 0).) 

reflection 4: Map each point into its mirror image (4 for mirror) across a line 
L, each point of which is left fixed by jz. See Fig. 12.3. The line L is the axis of 
reflection. (Example: (x, y) = (y, x) is a reflection across the line y = x.) 


glide reflection y: |The product of a translation and a reflection across a line mapped 
into itself by the translation. See Fig. 12.4. (Example: y(x, y) =(« +4, —y) isa 
glide reflection along the x-axis.) 


Notice the little curved arrow that is carried into another curved arrow in each of 
Figs. 12.1 through 12.4. For the translation and rotation, the counterclockwise directions 
of the curved arrows remain the same, but for the reflection and glide reflection, the 
counterclockwise arrow is mapped into a clockwise arrow. We say that translations and 
rotations preserve orientation, while the reflection and glide reflection reverse orien- 
tation. We do not classify the identity isometry as any definite one of the four types 
listed; it could equally well be considered to be a translation by the zero vector or a 
rotation about any point through an angle of 0°. We always consider a glide reflection to 
be the product of a reflection and a translation that is different from the identity isometry. 


+ This section is not used in the remainder of the text. 
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tr (0) Q (Q) 
12.1 Figure Translation t. 12.2 Figure Rotation p. 
P HQ) oR 


voy \ 


uP) QO wR) ee) F v(P) 


12.3 Figure Reflection pi. 12.4 Figure Glide reflection y. 


The theorem that follows describes the possible structures of finite subgroups of the 
full isometry group. 


12.5 Theorem Every finite group G of isometries of the plane is isomorphic to either Z, or to a dihedral 
group D, for some positive integer n. 


Proof Outline First we show that there is a point P in the plane that is left fixed by every isometry 
in G. This can be done in the following way, using coordinates in the plane. Suppose 
G = {o. b2. +++, Gm} and let 
(x7. Yi) = G7 (0, 0). 
Then the point 


P=GP= (= Haat tam 3 Lets) 
m m 

is the centroid of the set S = {(x;. y;)|i = 1.2. ---.m}. The isometries in G permute 
the points in S among themselves, since if ¢¢; = ¢, then $;(4;, yj) = @ {6;(0, 0] = 
(0, 0) = (xg, yx). It can be shown that the centroid of a set of points is uniquely 
determined by its distances from the points, and since each isometry in G just permutes 
the set S, it must leave the centroid (x, ¥) fixed. Thus G consists of the identity, rotations 
about P, and reflections across a line through P. 

The orientation-preserving isometries in G form a subgroup AH of G which is either 
all of G or of order m/2. This can be shown in the same way that we showed that the 
even permutations are a subgroup of S, containing just half the elements of S,,. (See 
Exercise 22.) Of course H consists of the identity and the rotations in G. If we choose a 
rotation in G that rotates the plane through as small an angle @ > 0 as possible, it can be 
shown to generate the subgroup H. (See Exercise 23.) This shows that if H = G, then 


G is cyclic of order m and thus isomorphic to Z,,. Suppose H + G so that G contains 
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some reflections. Let H = {1, 01, -++, Pn»—1} withn = m/2. If wis areflection in G, then 
the coset Hs consists of all n of the reflections in G. 

Consider now a regular n-gon in the plane having P as its center and with a vertex 
lying on the line through P left fixed by «. The elements of H rotate this n-gon through 
all positions, and the elements of Hy first reflect in an axis through a vertex, effectively 
turning the m-gon over, and then rotate through all positions. Thus the action of G on 
this n-gon is the action of D,,, so G is isomorphic to D,,. Sd 


The preceding theorem gives the complete story about finite plane isometry groups. 
We turn now to some infinite groups of plane isometries that arise naturally in decorating 
and art. Among these are the discrete frieze groups. A discrete frieze consists of a pattern 
of finite width and height that is repeated endlessly in both directions along its baseline 
to form a strip of infinite length but finite height; think of it as a decorative border strip 
that goes around a room next to the ceiling on wallpaper. We consider those isometries 
that carry each basic pattern onto itself or onto another instance of the pattern in the 
frieze. The set of all such isometries is called the “frieze group.” All discrete frieze 
groups are infinite and have a subgroup isomorphic to Z generated by the translation 
that slides the frieze lengthwise until the basic pattern is superimposed on the position 
of its next neighbor pattern in that direction. As a simple example of a discrete frieze, 
consider integral signs spaced equal distances apart and continuing infinitely to the left 
and right, indicated schematically like this. 


TG > 


Let us consider the integral signs to be one unit apart. The symmetry group of this frieze 
is generated by a translation t sliding the plane one unit to the right, and by a rotation ¢ 
of 180° about a point in the center of some integral sign. There are no horizontal or 
vertical reflections, and no glide reflections. This frieze group is nonabelian; we can 
check that to = pt~!. The n-th dihedral group D, is generated by two elements that 
do not commute, a rotation 9, through 360/n° of order m and a reflection w of order 
2 satisfying pi = Me, ! Thus it is natural to use the notation Do. for this nonabelian 
frieze group generated by t of infinite order and p of order 2. 
As another example, consider the frieze given by an infinite string of D’s. 


-- DDDDDDDDDDD.--: 


Its group is generated by a translation t one step to the right and by a vertical reflection yz 
across a horizontal line cutting through the middle of all the D’s. We can check that these 
group generators commute this time, that is, Tu = jt, so this frieze group is abelian 
and is isomorphic to Z x Zp. 

It can be shown that if we classify such discrete friezes only by whether or not their 
groups contain a 


rotation horizontal axis reflection 
vertical axis reflection nontrivial glide reflection 


then there are a total of seven possibilities. A nontrivial glide reflection in a symmetry 
group is one that is not equal to a product of a translation in that group and a reflection 
in that group. The group for the string of D’s above contains glide reflections across 
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the horizontal line through the centers of the D’s, but the translation component of each 
glide reflection is also in the group so they are all considered trivial glide reflections in 
that group. The frieze group for 


.. D D D D D. « 
tae, JD D D D eee 


contains a nontrivial glide reflection whose translation component is not an element of 
the group. The exercises exhibit the seven possible cases, and ask you to tell, for each 
case, which of the four types of isometries displayed above appear in the symmetry 
group. We do not obtain seven different group structures. Each of the groups obtained 
can be shown to be isomorphic to one of 


Z, Do, 2x, or Dy xh. 


Equally interesting is the study of symmetries when a pattern in the shape of a square, 
parallelogram, rhombus, or hexagon is repeated by translations along two nonparallel 
vector directions to fill the entire plane, like patterns that appear on wallpaper. These 
groups are called the wallpaper groups or the plane crystallographic groups. While a 
frieze could not be carried into itself by a rotation through a positive angle less than 
180°, it is possible to have rotations of 60°, 90°, 120°, and 180° for some of these 
plane-filling patterns. Figure 12.6 provides an illustration where the pattern consists of 
a square. We are interested in the group of plane isometries that carry this square onto 
itself or onto another square. Generators for this group are given by two translations 
(one sliding a square to the next neighbor to the right and one to the next above), by a 
rotation through 90° about the center of a square, and by a reflection in a vertical (or 
horizontal) line along the edges of the square. The one reflection is all that is needed to 
“urn the plane over”; a diagonal reflection can also be used. After being turned over, 
the translations and rotations can be used again. The isometry group for this periodic 
pattern in the plane surely contains a subgroup isomorphic to Z x Z generated by the 
unit translations to the right and upward, and a subgroup isomorphic to D4 generated by 
those isometries that carry one square (it can be any square) into itself. 

If we consider the plane to be filled with parallelograms as in Fig. 12.7, we do not 
get all the types of isometries that we did for Fig. 12.6. The symmetry group this time is 


12.6 Figure 
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12.7 Figure 


generated by the translations indicated by the arrows and a rotation through 180° about 
any vertex of a parallelogram. 

It can be shown that there are 17 different types of wallpaper patterns when they are 
classified according to the types of rotations, reflections, and nontrivial glide reflections 
that they admit. We refer you to Gallian [8] for pictures of these 17 possibilities and 
a chart to help you identify them. The exercises illustrate a few of them. The situation 
in space is more complicated; it can be shown that there are 930 three-dimensional 
crystallographic groups. The final exercise we give involves rotations in space. 

M. C. Escher (1898-1973) was an artist whose work included plane-filling patterns. 
The exercises include reproductions of four of his works of this type. 


mg EXERCISES 12 


1. This exercise shows that the group of symmetries of a certain type of geometric figure may depend on the 
dimension of the space in which we consider the figure to lie. 
a. Describe all symmetries of a point in the real line R; that is, describe all isometries of R that leave one point 

fixed. 

b. Describe all symmetries (translations, reflections, etc.) of a point in the plane R’. 
c. Describe all symmetries of a line segment in R. 
d. Describe all symmetries of a line segment in R’. 
e, Describe some symmetries of a line segment in R°. 


2. Let P stand for an orientation preserving plane isometry and R for an orientation reversing one. Fill in the table 
with P or R to denote the orientation preserving OT reversing properly of a product. 


P\R 
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3. Fill in the table to give all possible types of plane isometries given by a product of two types. For example, a 
product of two rotations may be a rotation, or it may be another type. Fill in the box corresponding to pp with 
both letters. Use your answer to Exercise 2 to eliminate some types. Eliminate the identity from consideration. 


4. Draw a plane figure that has a one-element group as its group of symmetries in R?. 

5, Draw a plane figure that has a two-element group as its group of symmetries in R?. 

6. Draw a plane figure that has a three-element group as its group of symmetries in R’. 

7, Draw a plane figure that has a four-element group isomorphic to Z4 as its group of symmetries in R?. 

8. Draw a plane figure that has a four-element group isomorphic to the Klein 4-group V as its group of symmetries 
in R?. 

9, For each of the four types of plane isometries (other than the identity), give the possibilities for the order of an 
isometry of that type in the group of plane isometries. 


10. A plane isometry ¢ has a fixed point if there exists a point P in the plane such that #(P) = P. Which of the 
four types of plane isometries (other than the identity) can have a fixed point? 


11. Referring to Exercise 10, which types of plane isometries, if any, have exactly one fixed point? 

12. Referring to Exercise 10, which types of plane isometries, if any, have exactly two fixed points? 

13. Referring to Exercise 10, which types of plane isometries, if any, have an infinite number of fixed points? 

14. Argue geometrically that a plane isometry that leaves three noncolinear points fixed must be the identity map. 


15. Using Exercise 14, show algebraically that if two plane isometries ¢ and w agree on three noncolinear points, 
that is, if o(P;) = W(P;) for noncolinear points P;, Po, and P3, then ¢ and w are the same map. 


16. Do the rotations, together with the identity map, form a subgroup of the group of plane isometries? Why or why 
not? 

17. Do the translations, together with the identity map, form a subgroup of the group of plane isometries? Why or 
why not? 

18. Do the rotations about one particular point P, together with the identity map, form a subgroup of the group of 
plane isometries? Why or why not? 


19. Does the reflection across one particular line L, together with the identity map, form a subgroup of the group 
of plane isometries? Why or why not? 


20. Do the glide reflections, together with the identity map, form a subgroup of the group of plane isometries? Why 
or why not? 

21. Which of the four types of plane isometries can be elements of a finite subgroup of the group of plane isometries? 

22. Completing a detail of the proof of Theorem 12.5, let G be a finite group of plane isometries. Show that 


the rotations in G, together with the identity isometry, form a subgroup H of G, and that either H = G or 
|G| = 2|A|. [Hint: Use the same method that we used to show that |S,,| = 2|A,|-] 
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23. Completing a detail in the proof of Theorem 12.5, let G be a finite group consisting of the identity isometry 
and rotations about one point P in the plane. Show that G is cyclic, generated by the rotation in G that turns 
the plane counterclockwise about P through the smallest angle 6 > 0. [Hint: Follow the idea of the proof that 
a subgroup of a cyclic group is cyclic.] 


Exercises 24 through 30 illustrate the seven different types of friezes when they are classified according to their 
symmetries. Imagine the figure shown to be continued infinitely to the right and left. The symmetry group of a 
frieze always contains translations. For each of these exercises answer these questions about the symmetry group 
of the frieze. 

. Does the group contain a rotation? 

. Does the group contain a reflection across a horizontal line? 

. Does the group contain a reflection across a vertical line? 

. Does the group contain a nontrivial glide reflection? 


on fn & f 


. To which of the possible groups Z, Doo, Z xX Zo, or Doo X Zz do you think the symmetry group of the 
frieze is isomorphic? 


4. FFFFFFFFFFFFFFF 
2. TTTTTTTITTT 
~» EEEEKEEEEEEE 


ry. UR . 
xy oa 
4 

x : : 
Py ae 

A / 

ie 
a as 
‘Ge - 2 
yO 6 
Fe 


ned 


12.8 Figure The Study of Regular Division of the Plane with Horsemen (© 1946 M.C. 
Escher Foundation—-Baarn—Holland. All rights reserved.) 
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Exercises 31 through 37 describe a pattern to be used to fill the plane by translation in the two directions given by 
the specified vectors. Answer these questions in each case. 


a. Does the symmetry group contain any rotations? If so, through what possible angles @ where 0 < 6 < 


180°? 
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an 
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12.9 Figure The Study of Regular Division of the Plane with Imaginary Human Figures (© 
1936 M. C. Escher Foundation—Baarn—Holland. All rights reserved.) 
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31. 
32. 
33. 


34. 


35. 


36. 
37. 
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12.10 Figure The Study of Regular Division of the Plane with Reptiles (© 1939 M. C. Escher 
Foundation—Baarn—Holland. All rights reserved.) 


b. Does the symmetry group contain any reflections? 


c. Does the symmetry group contain any nontrivial glide reflections? 


A square with horizontal and vertical edges using translation directions given by vectors (1, 0) and (0, 1). 

A square as in Exercise 31 using translation directions given by vectors (1, 1/2) and (0, 1). 

A square as in Exercise 31 with the letter L at its center using translation directions given by vectors (1, 0) and 
(0, 1). 

A square as in Exercise 31 with the letter E at its center using translation directions given by vectors (1, 0) and 
(0, 1). 

A square as in Exercise 31 with the letter H at its center using translation directions given by vectors (1, 0) and 
(0, 1). 

A regular hexagon with a vertex at the top using translation directions given by vectors (1, 0) and (1, V3). 

A regular hexagon with a vertex at the top containing an equilateral triangle with vertex at the top and centroid 
at the center of the hexagon, using translation directions given by vectors (1, 0) and (1, V3). 


Exercises 38 through 41 are concerned with art works of M. C. Escher. Neglect the shading in the figures 
and assume the markings in each human figure, reptile, or horseman are the same, even though they may be 
invisible due to shading. Answer the same questions (a), (b), and (c) that were asked for Exercises 31 through 
36, and also answer this part (d). 


d. Assuming horizontal and vertical coordinate axes with equal scales as usual, give vectors in the two nonpar- 
allel directions of vectors that generate the translation subgroup. Do not concern yourself with the length 
of these vectors. 
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12.11 Figure The Study of Regular Division of the Plane with Human Figures (© 1936 M. C. 
Escher Foundation—Baarn—Holland. All rights reserved.) 


38. The Study of Regular Division of the Plane with Horsemen in Fig. 12.8. 

39. The Study of Regular Division of the Plane with Imaginary Human Figures in Fig. 12.9. 
40. The Study of Regular Division of the Plane with Reptiles in Fig. 12.10. 

41. The Study of Regular Division of the Plane with Human Figures in Fig. 12.11. 


42. Show that the rotations of a cube in space form a group isomorphic to S4. [Hint: A rotation of the cube permutes 
the diagonals through the center of the cube.] 
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HoOMOMORPHISMS 


Structure-Relating Maps 


Let G and G’ be groups. We are interested in maps from G to G’ that relate the group 
structure of G to the group structure of G’. Such a map often gives us information 
about one of the groups from known structural properties of the other. An isomorphisnt 
o:G — G’, if one exists, is an example of such a structure-relating map. If we know all 
about the group G and know that ¢ is an isomorphism, we immediately know all about 
the group structure of G’, for it is structurally just a copy of G. We now consider more 
general structure-relating maps, weakening the conditions from those of an isomorphism 
by no longer requiring that the maps be one to one and onto. You see, those conditions are 
the purely set-theoretic portion of our definition of an isomorphism, and have nothing to 
do with the binary operations of G and of G’. The binary operations are what give us the 
algebra which is the focus of our study in this text. We keep just the homomorphism prop- 
erty of an isomorphism related to the binary operations for the definition we now make. 


A map @ of a group G into a group G’ is a homomorphism if the homomorphism 
property 


d(ab) = $(a)o) (1) 
holds for alla, b € G. | 


* Section 16 is a prerequisite only for Sections 17 and 36. 
* Section 17 is not required for the remainder of the text. 
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13.2 Example 


13.3 Example 


Solution 


13.4 Example 


Homomorphisms and Factor Groups 


Let us now examine the idea behind the requirement (1) for a homomorphism 
¢: G — G'. In Eq. (1), the product ab on the left-hand side takes place in G, while the 
product $(a)(b) on the right-hand side takes place in G’. Thus Eq. (1) gives a relation 
between these binary operations, and hence between the two group structures. 

For any groups G and G’, there is always at least one homomorphism @ : G > G’, 
namely the trivial homomorphism defined by $(g) = e’ for all g € G, where e’ is the 
identity in G’. Equation (1) then reduces to the true equation e’ = e’e’. No information 
about the structure of G or G’ can be gained from the other group using this trivial 
homomorphism. We give an example illustrating how a homomorphism @ mapping G 
onto G’ may give structural information about G’. 


Let d : G > G’ be agroup homomorphism of G onto G’. We claim that if G is abelian, 
then G’ must be abelian. Let a’, b’ € G’. We must show that a’b’ = b’a’. Since ¢ is 
onto G’, there exist a,b € G such that d(a) =a’ and @(b) = 0’. Since G is abelian, 
we have ab = ba. Using property (1), we have a’b’ = $(a)o(b) = b(ab) = O(ba) = 
¢(b)(a) = b’a', so G’ is indeed abelian? A 


Example 13.16 will give an illustration showing how information about G’ may 
give information about G via a homomorphism ¢ : G > G’. We now give examples of 
homomorphisms for specific groups. 


Let S,, be the symmetric group on n letters, and let @ : 5, > Zo be defined by 
so= 0 if is an even permutation, 
°)~)1~ if o is an odd permutation. 
Show that ¢ is a homomorphism. 


We must show that ¢(o 2) = é(o) + o() for all choices of o, 4 € S,. Note that the 
operation on the right-hand side of this equation is written additively since it takes place 
in the group Z. Verifying this equation amounts to checking just four cases: 

o odd and yu odd, 

o odd and p even, 

o even and yz odd, 


o even and y even. 


Checking the first case, if o and yz can both be written as a product of an odd number of 
transpositions, then o 4 can be written as the product of an even number of transpositions. 
Thus o(o 2) = 0 and (co) + o(4) = 1+ 1 = 0 in Zp. The other cases can be checked 
similarly. A 


(Evaluation Homomorphism) Let F be the additive group of all functions mapping 
R into R, let R-be the additive group of real numbers, and let c be any real number. Let 
dé: : F —> R be the evaluation homomorphism defined by ¢.(f) = f(c) for f € F. 
Recall that, by definition, the sum of two functions f and g is the function f + g whose 
value at x is f(x) + g(x). Thus we have 


b(f +g) =(F + EO) = fle) + 8) = be(f) + bc(8), 


and Eq. (1) is satisfied, so we have a homomorphism. A 


13.5 Example 


13.6 Example 


13.7 Example 


13.8 Example 


13.9 Example 


13.10 Example 


Solution 
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Let R" be the additive group of column vectors with n real-number components. (This 
group is of course isomorphic to the direct product of RR under addition with itself for 
n factors.) Let A be an m x n matrix of real numbers. Let ¢ : R” ~ R” be defined by 
o(v) = Av foreach column vector v € R". Then ¢ is a homomorphism, since for v, w € 
IR”, matrix algebra shows that @(v + w) = A(v + w) = Av + Aw = @(v) + ¢(w). In 
linear algebra, such a map computed by multiplying a column vector on the left by a 
matrix A is known as a linear transformation. A 


Let GL(n, R) be the multiplicative group of all invertible n x n matrices. Recall that a 
matrix A is invertible if and only if its determinant, det(A), is nonzero. Recall also that 
for matrices A, B € GL(n, R) we have 


det(AB) = det(A) det(B). 


This means that det is ahomomorphism mapping GL(n, R) into the multiplicative group 
R* of nonzero real numbers. A 


Homomorphisms of a group G into itself are often useful for studying the structure 
of G. Our next example gives a nontrivial homomorphism of a group into itself. 


Letr € Zand let o, : Z > Z be defined by ¢,(n) = rn for alln € Z. Forallm,n € Z, 
we have ¢-(m +n) =r(m +n) =rm+rn = >,(m) + ¢,(n) so @, isahomomorphism. 
Note that @p is the trivial homomorphism, ¢; is the identity map, and @_; maps Z onto 
Z. For all other r in Z, the map @, is not onto Z. A 


LetG = G; x G) xX -+- x G; x -++ x G,, beadirect product of groups. The projection 
map 7; : G > G; where 7;(g1. g2.--+>. 8i.°- +» Sn) = g; ts a homomorphism for each 
i = 1,2,---,n. This follows immediately from the fact that the binary operation of G 
coincides in the ith component with the binary operation in G;. A 


Let F be the additive group of continuous functions with domain [0, 1] and let R be the 
additive group of real numbers. The map o : F > R defined by a(f) = f- : fOddx for 
f €F is a homomorphism, for 


1] I 
5G os i (epawe= | LF) + g@ldx 


1 1 
= , pended / OA Lee 
0 0 
forall f,g EF. A 


(Reduction Modulo n) Let y be the natural map of Z into Z, given by y(m) =r, 
where r is the remainder given by the division algorithm when m is divided by n. Show 
that y is a homomorphism. 


We need to show that 
yo+tn=yo+rvO 
for s,¢ € Z. Using the division algorithm, we let 


s=qntn (2) 
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and 
t=qn+h2 (3) 
where 0 <7; < n fori = 1,2. If 
nth =gnt+rs (4) 
for 0 <r3 <n, then adding Eqs. (2) and (3) we see that 
str=(qatatgamnt+rs, 


so that y(s +f) = 73. 

From Egs. (2) and (3) we see that y(s) = 7; and y(t) = 72. Equation (4) shows that 
the sum r, +1 in Z, is equal to rz also. 

Consequently y(s +t) = y(s) + y(t), so we do indeed have a homomorphism. & 


Each of the homeomorphisms in the preceding three examples is a many-to-one map. 
That is, different points of the domain of the map may be carried into the same point. 
Consider, for illustration, the homomorphism 7; : Zz x Z4 > Zy in Example 13.8 We 
have 


71(0, 0) = 11 (0, 1) = 71(0, 2) = 71(0, 3) = 0, 


so four elements in Z. x Z4 are mapped into 0 in Z, by 7. 

Composition of group homomorphisms is again a group homomorphism. That is, if 
o:G— G’and y : G’ > G" are both group homomorphisms then their composition 
(yo): G—> G", where (vy 0 ¢)(g) = y(G(g)) for g € G, is also a homomorphism. 
(See Exercise 49.) 


Properties of Homomorphisms 


We turn to some structural features of G and G’ that are preserved by a homomorphism 
6: G > G’. First we review set-theoretic definitions. Note the use of square brackets 
when we apply a function to a subset of its domain. 


Let @ be a mapping of a set X into a set Y, and let A C X and B C Y. The image ¢[A] 
of Ain Y under @ is {¢(a)|a € A}. The set d[X] is the range of ¢. The inverse image 
@—'[B] of B in X is {x € X | d(x) € B}. | 


The first three properties of ahomomorphism stated in the theorem that follows have ° 
already been encountered for the special case of an isomorphism; namely, in Theorem 
3.14, Exercise 28 of Section 4, and Exercise 41 of Section 5. There they were really 
obvious because the structures of G and G’ were identical. We will now see that they 
hold for structure-relating maps of groups, even if the maps are not one to one and onto. 
We do not consider them obvious in this new context. 


Let @ be a homomorphism of a group G into a group G’. 


1. Ife is the identity element in G, then ((e) is the identity element e’ in G’. 
2. Ifa eG, then d(a7!) = 6a). 


Proof 


13.13 Definition 
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3. If H isa subgroup of G, then ¢[H] is a subgroup of G’. 
4, If K’ isa subgroup of G’, then @~![K’] is a subgroup of G. 


Loosely speaking, ¢ preserves the identity element, inverses, and subgroups. 


Let @ be a homomorphism of G into G’. Then 
g(a) = dae) = d(ag(e). 


Multiplying on the left by 6(a)~', we see that e’ = ¢(e). Thus ¢(e) must be the identity 
element e’ in G’. The equation 


e = o(e) = $(aa“') = ¢(a)o(a"') 


shows that d(a~!) = ¢(a)"}. 

Turning to Statement (3), let H be a subgroup of G, and let d(a) and ¢(b) be any 
two elements in ¢[H]. Then d(a)o(b) = G(ab), so we see that f(a)o(b) € ¢[ #7]; thus, 
$[H] is closed under the operation of G’. The fact that e’ = ¢(e) and ¢(a~!) = d(a)“! 
completes the proof that ¢[H] is a subgroup of G’. 

Going the other way for Statement (4), let K’ be a subgroup of G’. Suppose a and b are 
ing@—'[K’]. Then ¢(a)¢(b) € K’ since K' is asubgroup. The equation (ab) = ¢(a)o(b) 
shows that ab € ¢—'[K’]. Thus @~'[K’] is closed under the binary operation in G. Also, 
K’ must contain the identity element e’ = (e), so e € @7![K’]. If a € @ | [K’], then 
¢(a) € K', so ¢(a)~! € K’. But ¢(a)~! = ¢(a7!), so we must have a7! € 7![K’]. 
Hence ¢~![K’] is a subgroup of G. ° 


Let @ : G > G’ be a homomorphism and let e’ be the identity element of G’. Now 
{e’} is a subgroup of G’, so ¢~[{e’}] is a subgroup H of G by Statement (4) in Theorem 
13.12. This subgroup is critical to the study of homomorphisms. 


Let ¢:G—G’ be a homomorphism of groups. The subgroup @¢7'[{e’}] = 
{x € G| (x) = e’} is the kernel of ¢, denoted by Ker(@). z 


Example 13.5 discussed the homomorphism ¢ : R” > R” given by ¢(v) = Av 
where A is an m X n matrix. In this context, Ker() is called the null space of A. It 
consists of all v € R” such that Av = 0, the zero vector. 

Let H = Ker(¢) for a homomorphism ¢ : G — G’. We think of ¢ as “collapsing” 
H down onto e’. Theorem 13.15 that follows shows that for g € G, the cosets gH 
and Hg are the same, and are collapsed onto the single element @(g) by ¢. That is 
é—'[{(g)}] = gH = Hg. (Be sure that you understand the reason for the uses of (), [], 
and {} in @—'[{6(g)}].) We have attempted to symbolize this collapsing in Fig. 13.14, 
where the shaded rectangle represents G, the solid vertical line segments represent the 
cosets of H = Ker(@), and the horizontal line at the bottom represents G’. We view 
@ as projecting the elements of G, which are in the shaded rectangle, straight down 
onto elements of G’, which are on the horizontal line segment at the bottom. Notice 
the downward arrow labeled ¢ at the left, starting at G and ending at G’. Elements of 
H = Ker(@) thus lie on the solid vertical line segment in the shaded box lying over e’, 
as labeled at the top of the figure. 
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13.14 Figure Cosets of H collapsed by @. 
Let @: G + G’ be a group homomorphism, and let H = Ker(¢). Let a € G. Then the 
set 
6 'e@}] = & € G| oa) = ¢@} 


is the left coset aH of H, and is also the right coset Ha of H. Consequently, the two 
partitions of G into left cosets and into right cosets of H are the same. 


Proof We want to show that 


{x €G|¢@) = O@} = aH. 


There is a standard way to show that two sets are equal; show that each is a subset 
of the other. 


Suppose that (x) = ¢(a). Then 
play oa) =e’, 


where e’ is the identity of G’. By Theorem 13.12, we know that (a)! = g(a"), 
so we have 


g(a ")o(x) =e. 
Since @ is a homomorphism, we have 
g(a )o(x) = o(a7'x), so. (a !x) =e’. 


But this shows that a~!x is in H = Ker(#), so a~'x =h for some h € H, and x = 
ah € aH. This shows that 


{x € G| b(x) = o(a)} Call. 


13.16 Example 


13.17 Example 


13.18 Corollary 


Proof 
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To show containment in the other direction, let y € aH, so that y = ah for some 
h € H. Then 


o(y) = b(ah) = d(a)b(h) = G(a)e’ = $a), 


so that y € {x € G|¢(@) = o(a)}. 
We leave the similar demonstration that {x ¢ G|é(x) = ¢(a@)} = Ha to Exercise 
52. 4 


Equation 5 of Section 1 shows that |z1z2| = |z1||z2| for complex numbers z; and Zp. 
This means that the absolute value function | | is a homomorphism of the group C* 
of nonzero complex numbers under multiplication onto the group R* of positive real 
numbers under multiplication. Since {1} is a subgroup of R*, Theorem 13.12 shows 
again that the complex numbers of magnitude 1 forma subgroup U of C*. Recall that the 
complex numbers can be viewed as filling the coordinate plane, and that the magnitude 
of a complex number is its distance from the origin. Consequently, the cosets of U are 
circles with center at the origin. Each circle is collapsed by this homomorphism onto its 
point of intersection with the positive real axis. A 


We give an illustration of Theorem 13.15 from calculus. 


Let D be the additive group of all differentiable functions mapping R into R, and let F 
be the additive group of all functions mapping R into R. Then differentiation gives us a 
map@: D— F, where ¢(f) = f’ for f € F. Weeasily see that ¢ is a homomorphism, 
for o(f +g) =(f +28) = f' +8 = o(f) + O(g); the derivative of a sum is the sum 
of the derivatives. 

Now Ker(@) consists of all functions f such that f’ = 0, the zero constant function. 
Thus Ker(@) consists of all constant functions, which form a subgroup C of F. Let us 
find all functions in G mapped into x? by ¢, that is, all functions whose derivative is 
x2. Now we know that x? /3 is one such function. By Theorem 13.15, all such functions 
form the coset x*/3 + C. Doesn’t this look familiar? A 


We will often use the following corollary of Theorem 13.15. 
A group homomorphism ¢ : G — G’ is a one-to-one map if and only if Ker(¢) = {e}. 


If Ker(d) = {e}, then for every a € G, the elements mapped into ¢(a) are precisely the 
elements of the left coset a{e} = {a}, which shows that @ is one to one. 

Conversely, suppose ¢ is one to one. Now by Theorem 13.12, we know that d(e) = e’, 
the identity element of G’. Since ¢ is one to one, we see that e is the only element mapped 
into e’ by , so Ker(@) = {e}. 5 


In view of Corollary 13.18, we modify the outline given prior to Example 3.8 for 
showing that a map @ is an isomorphism of binary structures when the structures are 
groups G and G’. 
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To Show ¢ : G — G' Is an Isomorphism 


Step 1 Show ¢ is a homomorphism. 
Step 2 Show Ker(¢) = {e}. 
Step 3 Show ¢ maps G onto G’. 


Theorem 13.15 shows that the kernel of a group homomorphism ¢: G > G’ isa 
subgroup H of G whose left and right cosets coincide, so that gH = Hg for all g € G. 
We will see in Section 14 that when left and right cosets coincide, we can form a coset 
group, as discussed intuitively in Section 10. Furthermore, we will see that H then 
appears as the kernel of a homomorphism of G onto this coset group in a very natural 
way. Such subgroups H whose left and right cosets coincide are very useful in studying 
a group, and are given a special name. We will work with them a lot in Section 14. 


@ HistoricaL NoTE 


ormal subgroups were introduced by Evariste 

Galois in 1831 as a tool for deciding whether 
a given polynomial equation was solvable by rad- 
icals. Galois noted that a subgroup H of a group 
G of permutations induced two decompositions of 
G into what we call left cosets and right cosets. 
If the two decompositions coincide, that is, if the 
left cosets are the same as the right cosets, Galois 
called the decomposition proper. Thus a subgroup 
giving a proper decomposition is what we call a 
normal subgroup. Galois stated that if the group 


of permutations of the roots of an equation has a 
proper decomposition, then one can solve the given 
equation if one can first solve an equation corre- 
sponding to the subgroup H and then an equation 
corresponding to the cosets. 

Camille Jordan, in his commentaries on 
Galois’s work in 1865 and 1869, elaborated on these 
ideas considerably. He also defined normal sub- 
groups, although without using the term, essentially 
as on this page and likewise gave the first definition 
of a simple group (page 149). 


————— 


_| 


13.19 Definition A subgroup H of a group G is normal if its left and right cosets coincide, that is, if 


13.20 Corollary 


Proof 


gH = Ag forall g €G. | 


Note that all subgroups of abelian groups are normal. 
If ¢: G > G’ is a group homomorphism, then Ker(@) is a normal subgroup of G. 


This follows immediately from the last sentence in the statement of Theorem 13.15 and 
Definition 13.19. Sa 


For any group homomorphism ¢ : G > G’, two things are of primary importance: 
the kernel of @, and the image ¢[G] of G in G’. We have indicated the importance of 
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Ker(@). Section 14 will indicate the importance of the image @[G]. Exercise 44 asks us 
to show that if |G/ is finite, then |@[G]] is finite and is a divisor of |G]. 


@ EXERCISES 13 


Computations 


In Exercises 1 through 15, determine whether the given map ¢ is a homomorphism. [Hint: The straightforward 
way to proceed is to check whether ¢(ab) = #(a)(b) for all a and b in the domain of ¢. However, if we should 
happen to notice that 67! [{e’}] is not a subgroup whose left and right cosets coincide, or that ¢ does not satisfy the 
properties given in Exercise 44 or 45 for finite groups, then we can say at once that ¢ is not a homomorphism.] 


1. 
. Let @: R > Z under addition be given by 6(x) = the greatest integer < x. 


TAMER YD 


Let @ : Z > R under addition be given by ¢(n) =n. 


. Let @ : R* + R* under multiplication be given by (x) = |x|. 


Let @ : Ze — Zp be given by o(x) = the remainder of x when divided by 2, as in the division algorithm. 
Let ¢ : Zo — Zp» be given by o(x) = the remainder of x when divided by 2, as in the division algorithm. 
Let @: R — R*, where R is additive and R* is multiplicative, be given by (x) = 2*. 


. Let ¢, : Gj > Gi x G2 X ++» x G; x ++- x G, be given by ¢;(g;) = (e1, €2...., Bi, +++, @), Where g; € G; 


and e; is the identity element of G;. This is an injection map. Compare with Example 13.8. 


8. Let G be any group and let ¢ : G > G be given by ¢(g) = g-| forg €G. 


9, Let F be the additive group of functions mapping R into R having derivatives of all orders. Let @ : F > F be 


10. 


11. 
12. 


13. 


14. 


15, 


given by @(f) = 7”, the second derivative of f. 


Let F be the additive group of all continuous functions mapping R into R. Let R be the additive group of real 
numbers, and let dé : F — R be given by 


al 
of) =| F@jdx. 


Let F be the additive group of all functions mapping R into R, and let @: F — F be given by @(f) = 3/f. 


Let M, be the additive group of all n x n matrices with real entries, and let R be the additive group of real 
numbers. Let (A) = det(A), the determinant of A, for A € M,. 

Let M,, and R be as in Exercise 12. Let @(A) = tr(A) for A € M,, where the trace tr(A) is the sum of the 
elements on the main diagonal of A, from the upper-left to the lower-right corner. 

Let GL(n, R) be the multiplicative group of invertible n x n matrices, and let R be the additive group of real 
numbers. Let dé: GL(n, R) — R be given by (A) = tr(A), where tr(A) is defined in Exercise 13. 

Let F be the multiplicative group of all continuous functions mapping R into R that are nonzero at every x € R. 
Let R* be the multiplicative group of nonzero real numbers. Let ¢ : F — R* be given by #(f) = ie F)dx. 


In Exercises 16 through 24, compute the indicated quantities for the given homomorphism ¢. (See Exercise 46.) 


16. 
17. 
18. 


Ker(@) for @ : $3 — Zp. in Example 13.3 
Ker(¢) and @(25) for @ : Z — Z; such that #(1) = 4 
Ker(@) and 6(18) for @ : Z — Zyo such that (1) = 6 
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19, Ker(d) and #(20) for d : Z — Sg such that (1) = (1, 4, 2, 6)(2, 5, 7) 

Ker(¢) and (3) for @ : Zo — Zoo such that d(1) = 8 

Ker(@) and (14) for @ : Zoq > Sg where @(1) = (2, 5)(1, 4, 6, 7) 

Ker(@) and ¢(—3, 2) ford : Z x Z— Z where $(1, 0) = 3 and d(0, 1) = —5 

Ker(@) and #(4, 6) foré : Zx Z => Z x Zwhere o(1, 0) = (2, —3) and #(0, 1) = (-1, 5) 

Ker(@) and #(3, 10) for 6 : Z x Z > Siq where (1, 0) = (3, 5)(2, 4) and (0, 1) = 1, 7)(6, 10, 8, 9) 
How many homomorphisms are there of Z onto Z? 


20. 
21. 
22. 
23. 
24. 
25. 
26. 
27. 
28. 


29. 


Concepts 


How many homomorphisms are there of Z into Z? 


How many homomorphisms are there of Z into Zz? 
Let G be a group, and let g € G. Let dy : G > G be defined by ¢,(x) = gx for x € G. For which g € G is 
oe ahomomorphism? 


Let G be a group, and let g € G. Let @, : G > G be defined by ¢,(x) = gexg”! for x € G. For which g €G 
is ¢g a homomorphism? 


In Exercises 30 and 31, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 


30. A homomorphism is a map such that éG@vy) = ()é(y). 


31. 


32. 


Let ¢ : G > G’ be ahomomorphism of groups. The kernel of @ is {x € G| f(x) = e’} where e’ is the identity 


in G’. 


Mark each of the following true or false. 


. A, is anormal subgroup of S,. 

. For any two groups G and G’, there exists a homomorphism of G into G’. 

. Every homomorphism is a one-to-one map. 

. A homomorphism is one to one if and only if the kernel consists of the identity element alone. 

. The image of a group of 6 elements under some homomorphism may have 4 elements. (See Exercise 


44.) 


. The image of a group of 6 elements under a homomorphism may have 12 elements. 

. There is a homomorphism of some group of 6 elements into some group of 12 elements. 

. There is a homomorphism of some groups of 6 elements into some group of 10 elements. 

i. A homomorphism may have an empty kernel. 

j. It is not possible to have a nontrivial homomorphism of some finite group into some infinite group. 


In Exercises 33 through 43, give an example of a nontrivial homomorphism ¢ for the given groups, if an example 
exists. If no such homomorphism exists, explain why that is so. You may use Exercises 44 and 45. 


33. 
35. 
37. 
39, 
41. 
43. 


6 


= 6+ 6 6 6 


Zp > Zs 34. @: 2 > Zs 

: Zo x Ly > Zo x Bs 36.¢:2;-2Z2 
:Z3—> Ss 38. 6: Z— 83 
:ZxZ— 2Z 40.¢:2Z2-2Z2xZ 
: Dg > $3 42. @: S3 > Sa 


: S84 — $3 
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44. 


45. 


46. 


47. 


48. 


49, 


50. 


51. 


§2. 


53. 


54. 


55. 


Let ¢ : G — G’ be a group homomorphism. Show that if |G| is finite, then |¢[G]]| is finite and is a divisor 
of |G. 

Let ¢ : G — G’ be a group homomorphism. Show that if |G’| is finite, then, |@[G]| is finite and is a divisor 
of |G’. 

Let a group G be generated by {a; |i € 7}, where / is some indexing set anda; € Gforalli ¢].Let@:G>G’ 
and jt : G > G’ be two homomorphisms from G into a group G’, such that @(a;) = «(a;) for every € I. Prove 
that @ = uw. [Thus, for example, a homomorphism of a cyclic group is completely determined by its value on a 
generator of the group.] [Hint: Use Theorem 7.6 and, of course, Definition 13.1.] 

Show that any group homomorphism ¢ : G > G’ where |G| is a prime must either be the trivial homomorphism 
or a one-to-one map. 

The sign of an even permutation is +1 and the sign of an odd permutation is —1. Observe that the map 
sgn, : S, — {1, —1} defined by 


sen, (o) = sign of o 


is a homomorphism of S$, onto the multiplicative group {1, —1}. What is the kernel? Compare with Example 
13.3. 


Show that if G,-G’, and G” are groups and if 6 : G ~ G’ and y : G’ > G” are homomorphisms, then the 
composite map yd : G > G” is ahomomorphism. 

Let 6: G > H be a group homomorphism. Show that [G] is abelian if and only if for all x, y € G, we have 
xyx ly! © Ker(d). 

Let G be any group and let a be any element of G. Let @ : Z — G be defined by o(n) = a”. Show that ¢ is a 
homomorphism. Describe the image and the possibilities for the kernel of ¢. 

Let @ : G — G’ be a homomorphism with kernel H and let a € G. Prove the set equality (x «€ G{ d(x) = 
o(a)} = Ha. 

Let G be a group, Let h,k € G andlet@: Zx Z— G be defined by (m,n) = hk”. Give a necessary and 
sufficient condition, involving h and k, for ¢ to be a homomorphism. Prove your condition. 


Find a necessary and sufficient condition on G such that the map ¢ described in the preceding exercise is a 
homomorphism for all choices of h, k € G. 


Let G be a group, h an element of G, and n a positive integer. Let ¢ : Z, > G be defined by $@) = A’ for 
0 <i <n. Give a necessary and sufficient condition (in terms of h and n) for ¢ to be a homomorphism. Prove 
your assertion. 


Factor Groups 


Let H beasubgroup of a finite group G. Suppose we write a table for the group operation 
of G, listing element heads at the top and at the left as they occur in the left cosets of 
H.. We illustrated this in Section 10. The body of the table may break up into blocks 
corresponding to the cosets (Table 10.5), giving a group operation on the cosets, or they 
may not break up that way (Table 10.9). We start this section by showing that if 7 is the 
kernel of a group homomorphism ¢ : G > G’', then the cosets of H (remember that left 
and right cosets then coincide) are indeed elements of a group whose binary operation 
is derived from the group operation of G. 
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Homomorphisms and Factor Groups 


Factor Groups from Homomorphisms 


Let G be a group and let S be a set having the same cardinality as G. Then there is a one- 
to-one correspondence < between S and G. We can use <> to define a binary operation 
on S$, making S into a group isomorphic to G. Naively, we simply use the correspondence 
to rename each element of G by the name of its corresponding (under <>) element in S. 
We can describe explicitly the computation of xy for x, y € S as follows: 


ifx<og, and you. and ~ogig2z, then xy =z. (1) 


The direction —> of the one-to-one correspondence s <> g between se S and g€G 
gives us a one-to-one function , mapping S onto G. (Of course, the direction <— of <> 
gives us the inverse function .~'). Expressed in terms of jz, the computation (1) of xy 
for x, y € S becomes 


ifu(x)=2, and wly)=g. and u(z)=gigo, then xy=z. (2) 


The map 2: S > G now becomes an isomorphism mapping the group S$ onto the 
group G. Notice that from (2), we obtain wwy) = u(z) = g182 = u(x) u(y), the required 
homomorphism property. 

Let G and G’ be groups, let @ : G — G’ be a homomorphism, and let H = Ker(@). 
Theorem 13.15 shows that for a € G, we have ¢7|[{#(a)}] = aH = Ha. We have a 
one-to-one correspondence aH <> (a) between cosets of H in G and elements of the 
subgroup ¢[G] of G’. Remember that if x € aH, so that x = ah for some h € H, then 
$(x) = o(ah) = d(a)o(h) = o(@e’ = (a), so the computation of the element of ¢[G] 
corresponding to the coset aH = xH is the same whether we compute it as ¢(@) or as 
$(x). Let us denote the set of all cosets of H by G/H. (We read G/H as “G over H” or 
as “G modulo H” or as “G mod H,” but never as “G divided by H.”) 

In the preceding paragraph, we started with a homomorphism ¢ : G > G’ having 
kernel H, and we finished with the set G/H of cosets in one-to-one correspondence with 
the elements of the group #[G]. In our work above that, we had a set S with elements 
in one-to-one correspondence with a those of a group G, and we made S$ into a group 
isomorphic to G with an isomorphism 2. Replacing S by G/H and replacing G by 
¢[G] in that construction, we can consider G/H to be a group isomorphic to ¢[G] with 
that isomorphism jz. In terms of G/H and $[G], the computation (2) of the product 
(xH)(yH) forxH, yH € G/H becomes 


if u(x) = (x) and pw(yH)=¢(y) and pu(zH) = (x)b(y), 
then (xA)(yH) = 2H. (3) 
But because @ is a homomorphism, we can easily find z € G such that u(zH) = 
o(x)o(y); namely, we take z = xy in G, and find that 
H(ZH) = way) = oy) = 6) O(y). 


This shows that the product (x H)(y H) of two cosets is the coset (xy)H that contains 
the product xy of x and y in G. While this computation of (x H)(yH) may seem to 
depend on our choices x from x H and y from yH, our work above shows it does not. 
We demonstrate it again here because it is such an important point. If hy. h2 € A so that 
xh, is an element of xH and yh is an element of yH, then there exists hz ¢ H such 


14.1 Theorem 


14.2 Example 


14.3 Example 
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that h, y = yh3 because Hy = yH by Theorem 13.15. Thus we have 
(hy)h2) = xChiy)ho = x(yha)ho = (xy) (ish) € oy) A, 


so we obtain the same coset. Computation of the product of two cosets is accomplished 
by choosing an element from each coset and taking, as product of the cosets, the coset 
that contains the product in G of the choices. Any time we define something (like a 
product) in terms of choices, it is important to show that it is well defined, which means 
that it is independent of the choices made. This is precisely what we have just done. We 
summarize this work in a theorem. 


Let ¢ : G — G' be a group homomorphism with kernel H. Then the cosets of H form 
a factor group, G/H, where (aH)(bH) = (ab)H. Also, the map uw: G/H > @[G] 
defined by (aH) = @(a) is an isomorphism. Both coset multiplication and jz are well 
defined, independent of the choices a and b from the cosets. 


Example 13.10 considered the map y : Z > Z,, where y(m) is the remainder when 
m is divided by n in accordance with the division algorithm. We know that y is a 
homomorphism. Of course, Ker(y) = Z. By Theorem 14.1, we see that the factor 
group Z/nZ is isomorphic to Z,. The cosets of nZ are the residue classes modulo n. For 
example, taking n = 5, we see the cosets of SZ are 


§Z = {-:-,-10, —5, 0,5, 10, ---}, 
PSF See 41-6, 1d 38); 
P50 = P28, 8210), 
RAST he 8 Ie 
44+5Z ={---,-6,-1,4,9, 14,--+} 


Note that the isomorphism yu : Z/5Z — Zs of Theorem 14.1 assigns to each coset of 
5Z its smallest nonnegative element. That is, 4(5Z) = 0, (1 + 5Z) = 1, etc. A 


It is very important that we learn how to compute in a factor group. We can multiply 
(add) two cosets by choosing any two representative elements, multiplying (adding) 
them and finding the coset in which the resulting product (sum) lies. 


Consider the factor group Z/5Z with the cosets shown above. We can add (2+ 5Z) + 
(4+5Z) by choosing 2 and 4, finding 2 + 4 = 6, and noticing that 6 is in the coset 
1 +5Z. We could equally well add these two cosets by choosing 27 in2 + 5Z and —16 
in 4+ 5Z; the sum 27 + (—16) = 11 is also in the coset 1 + 5Z. A 


The factor groups Z/nZ in the preceding example are classics. Recall that we refer 
to the cosets of nZ as residue classes modulo n. Two integers in the same coset are 
congruent modulo n. This terminology is carried over to other factor groups. A factor 
group G/# is often called the factor group of G modulo H. Elements in the same 
coset of H are often said to be congruent modulo H. By abuse of notation, we may 
sometimes write Z/nZ = Z,, and think of Z,, as the additive group of residue classes of 
Z modulo (n), or abusing notation further, modulo n. 
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14.4 Theorem 


Proof 


14.5 Corollary 


Homomorphisms and Factor Groups 


Factor Groups from Normal Subgroups 


So far, we have obtained factor groups only from homomorphisms. Let G be a group and 
let H be a subgroup of G. Now H has both left cosets and right cosets, and in general, 
a left coset aH need not be the same set as the right coset Ha. Suppose we try to define 
a binary operation on left cosets by defining 


(aH)(bH) = (ab)H (4) 


as in the statement of Theorem 14.1 Equation 4 attempts to define left coset multiplication 
by choosing representatives a and b from the cosets. Equation 4 is meaningless unless 
it gives a well-defined operation, independent of the representative elements a and b 
chosen from the cosets. The theorem that follows shows that Eq. 4 gives a well-defined 
binary operation if and only if H is a normal subgroup of G. 


Let H be a subgroup of a group G. Then left coset multiplication is well defined by the 
equation 


(aH)(bH) = (ab)H 
if and only if H is a normal subgroup of G. 


Suppose first that (@H)(bH) = (ab)H does give a well-defined binary operation on left 
cosets. Let a € G. We want to show that aH and Ha are the same set. We use the 
standard technique of showing that each is a subset of the other. 

Let x € aH. Choosing representatives x ¢aH and a-!¢a!H, we have 
(xH)(a-!H) = (xa7!)H. On the other hand, choosing representatives a € aH and 
a7! € aH, we see that (aH)(a7!H) = eH = H. Using our assumption that left coset 
multiplication by representatives is well defined, we must have xa~! = h € H. Then 
x = ha, so x € Ha and aH C Ha. We leave the symmetric proof that Ha C aH to 
Exercise 25. 

We turn now to the converse: If H is anormal subgroup, then left coset multiplication 
by representatives is well-defined. Due to our hypothesis, we can simply say cosets, 
omitting left and right. Suppose we wish to compute (a@H)(bH). Choosing a € aH and 
b € bH, we obtain the coset (ab)H. Choosing different representatives ah; ¢ aH and 
bh2 € bH, we obtain the coset ah1bh2H. We must show that these are the same coset. 
Now A1b € Hb = bH, so hb = bh; for some h3 € H. Thus 


(ah,)(6h2) = ahi b)h2 = a(bha)hz = (ab)(h3h2) 
and (ab)(h3h2) € (ab)H. Therefore, ah bh is in (ab)H. « 


Theorem 14.4 shows that if left and right cosets of H coincide, then Eq. 4 gives a 
well-defined binary operation on cosets. We wonder whether the cosets do form a group 
with such coset multiplication. This is indeed true. 


Let H be anormal subgroup of G. Then the cosets of H form a group G/H under the 
binary operation (@H)(bH) = (ab)H. A 


Proof 


14.6 Definition 


14.7 Example 


14.8 Example 


14.9 Theorem 


Proof 
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Computing, (aH)[(bH)\(cH)] = (aH)[(bc)H] = [a(bc)|H, and similarly, we have 
[((aH)(bH)|(cH) = [(ab)c]H, so associativity in G/H follows from associativity in 
G. Because (aH )\(eH) = (ae)H = aH = (ea)H = (eH)\(aH), we see that eH = H is 
the identity element in G/H. Finally, (a7! H)\(aH) = (aa)H =eH = (aa~')H = 
(aH)\(a7'H) shows that a7'H = (aH)"!. o 


The group G/H in the preceding corollary is the factor group (or quotient group) of 
G by H. a 


Since Z is an abelian group, nZ is a normal subgroup. Corollary 14.5 allows us to 
construct the factor group Z/nZ with no reference to a homomorphism. As we observed 
in Example 14.2, Z/nZ is isomorphic to Z,. A 


Consider the abelian group R under addition, and let c € R~. The cyclic subgroup (c) 
of R contains as elements 


«++ = 3e, —2c, -—c, 0, €, 2c, 3c, ++>. 


Every coset of (c) contains just one element of x such that 0 < x < c. If we choose these 
elements as representatives of the cosets when computing in R/(c), we find that we are 
computing their sum modulo ¢ as discussed for the computation in R, in Section 1. 
For example, if c = 5.37, then the sum of the cosets 4.65 + (5.37) and 3.42 + (5.37) 
is the coset 8.07 + (5.37), which contains 8.07 — 5.37 = 2.7, which is 4.65 +5 37 3.42. 
Working with these coset elements x where 0 < x < c, we thus see that the group R, of 
Example 4.2 is isomorphic to R/(c) under an isomorphism w where W(x) = x + (c) for 
all x € R,. Of course, R/(c} is then also isomorphic to the circle group U of complex 
numbers of magnitude 1 under multiplication. A 


We have seen that the group Z/(n) is isomorphic to the group Z,, and as a set, 
Z, = {0,1,3,4,---, — 1}, the set of nonnegative integers less than n. Example 14.8 
shows that the group R/{c) is isomorphic to the group R.. In Section 1, we choose the 
notation R,. rather than the conventional [0, c) for the half-open interval of nonnegative 
real numbers less than c. We did that to bring out now the comparison of these factor 
groups of Z with these factor groups of R. 


The Fundamental Homomorphism Theorem 


We have seen that every homomorphism ¢ : G — G’ gives rise to a natural factor group 
(Theorem 14.1), namely, G/Ker(@). We now show that each factor group G/H gives rise 
to a natural homomorphism having H as kernel. 


Let H be a normal subgroup of G. Then y : G > G/H given by y(x) =xH isa 
homomorphism with kernel 7. 
Let x, y € G. Then 


yxy) = (xy) A = (AyD) = y@)y), 
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14.11 Theorem 


14.12 Example 


Solution 


Homomorphisms and Factor Groups 


G/H 


14.10 Figure 


so y isa homomorphism. Since x7 = H if an only if x € H, we see that the kernel of 
y is indeed H. o 


We have seen in Theorem 14.1 thatif@ : G > G’isahomomorphism withkernel H, 
then u : G/H — $[G] where (gH) = 9(g) is an isomorphism. Theorem 14.9 shows 
that y : G > G/H defined by y(g) = gH isa homomorphism. Figure 14.10 shows 
these groups and maps. We see that the homomorphism @¢ can be factored, 6 = jy, 
where y is a homomorphism and jz is an isomorphism of G/H with ¢[G]. We state this 
as a theorem. 


(The Fundamental Homomorphism Theorem) Let ¢ : G — G’ be a group homo- 
morphism with kernel H. Then $[G] is a group, and 4: G/H — ¢[G] given by 
u(gH) = (g) is an isomorphism. If y : G > G/H is the homomorphism given by 
v(g) = gH, then $(g) = wy(g) for each g € G. 


The isomorphism jz in Theorem 14.11 is referred to as a natural or canonical 
isomorphism, and the same adjectives are used to describe the homomorphism y. There 
may be other isomorphisms and homomorphisms for these same groups, but the maps 
and y have a special status with @ and are uniquely determined by Theorem 14.11. 

In summary, every homomorphism with domain G gives rise to a factor group G/F, 
and every factor group G/H gives rise to a homomorphism mapping G into G/H. 
Homomorphisms and factor groups are closely related. We give an example indicating 
how useful this relationship can be. 


Classify the group (Zs x Z2)/({0} x Za) according to the fundamental theorem of finitely 
generated abelian groups (Theorem 11.12). 


The projection map 7 : Z4 x Zz > Z4 given by m (x,y) =x isa homomorphism of 
Za x Zo onto Zq with kernel {0} x Z2. By Theorem 14.11, we know that the given factor 
group is isomorphic to Z4. A 


Normal Subgroups and Inner Automorphisms 


We derive some alternative characterizations of normal subgroups, which often provide 
us with an easier way to check normality than finding both the left and the right coset 
decompositions. 


14.13 Theorem 


14.14 Example 


14.15 Definition 
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Suppose that H is a subgroup of G such that ghg! € H for all g ¢ G and all 
h © H. Then gHge-! = {ghe'|h € H} CH for all g € G. We claim that actually 
gHg-' = H. We must show that H © gHg™! forall g € G. Leth € H. Replacing g by 
g—! in the relation ghg—! € H, we obtain g—h(g—!)—! = g-'hg = Ay where hy € H. 
Consequently, k = ghig7' € gHg7', and we are done. 

Suppose that gH = Hg forall g € G. Then gh = hig,s0 ghg! € H forallg € G 
and all hk < H. By the preceding paragraph, this means that gHg-! = H forall g € G. 
Conversely, if gHg-! = H for all g € G, then ghg"! = h, so gh = hig € Hg, and 
gH © Hg. But also, g-'Hg =H giving g—'hg = ho, so that hg = ghy and Hg € 
gH. 

We summarize our work as a theorem. 


The following are three equivalent conditions for a subgroup H of a group G to be a 
normal subgroup of G. 


1. ghg-'e A forallg € Gandhe H. 
2. gHg' =H forallg €G. 
3. gH = Hg forallg eG. 


Condition (2) of Theorem 14.13 is often taken as the definition of a normal subgroup 
H of a group G. 


Every subgroup A of an abelian group G is normal. We need only note that gh = hg 
for allh € H andall g € G, so, of course, ghg} =heH forallg eGandallhe dH. 
A 


Exercise 29 of Section 13 shows that the mapi, : G > G defined by i,(x) = gxg™! 


is ahomomorphism of G into itself. We see that gag—! = gbg™! if and only if a = 6, so 
i, isone to one. Since g(g~!yg)g—! = y, we see that i, is onto G, so itis an isomorphism 
of G with itself. 


An isomorphism ¢ : G — G of a group G with itself is an automorphism of G. The 
automorphism i, : G — G, where i,(x) = exg! for all x € G, is the inner automor- 
phism of G by g. Performing i, on x is called conjugation of x by g. | 


The equivalence of conditions (1) and (2) in Theorem 14.13 shows that gH = Hg 
for all g € Gifand only ifi,{H] = H forall g € G, thatis, if and only if H is invariant 
under all inner automorphisms of G. It is important to realize that ig[H] = H is an 
equation in sets; we need not have ip(h) = h for all h € H. That is ig may perform a 
nontrivial permutation of the set H. We see that the normal subgroups of a group G are 
precisely those that are invariant under all inner automorphisms. A subgroup K of G is 
a conjugate subgroup of H if K =i,[H] for some g € G. 
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Computations 
In Exercises 1 through 8, find the order of the given factor group. 


1, Z6/{3) 2. (Z4 x Zy2)/({2) x {2)) 

3. (Ze x Zr)/((2, 1) 4, (Z3 x Z5)/({0} x Zs) 

5. (Zo x Z4)/{(1, 1)) 6. (Zi2 x Zig)/((4, 3)) 

7. (Zy x S3)/{C, p1)) 8. (Zy) x Zy5)/((A, D) 
In Exercises 9 through 15, give the order of the element in the factor group. 

9. 5+ (4) in Z,2/(4) 10. 26 + (12) in Zeo/(12) 
11. (2,1) + (1, 1)) in 3 x Ze)/(CL, 1)) 12. (3, 1) + (C, 1)) in (Zy x Za)/(C1, 1)) 
13. (3, 1) + (C0, 2)) in (Zq x Zg)/{(O, 2)} 14. (3,3) + (C1, 2)) in (Zq x Zg)/((1, 2)) 


15. (2,0) + (4, 4)) in Ze x Zg)/((4, 4) 
16. Compute i,,[H] for the subgroup H = {0, 1} of the group 53 of Example 8.7. 


Concepts 

In Exercises 17 through 19, correct the definition of the italicized term without reference to the text, if correction 
is needed, so that it is in a form acceptable for publication. 

17. A normal subgroup H of G is one satisfying hG = Gh forallh € H. 

18. A normal subgroup H of G is one satisfying g~'hg € H for allA € H andall ge G. 

19. An automorphism of a group G is a homomorphism mapping G into G. 

20. What is the importance of a normal subgroup of a group G? 


Students often write nonsense when first proving theorems about factor groups. The next two exercises are designed 
to call attention to one basic type of error. 


21. A student is asked to show that if H is a normal subgroup of an abelian group G, then G/H is abelian. The 
student’s proof starts as follows: 
We must show that G/H is abelian. Let a and b be two elements of G/H. 
a. Why does the instructor reading this proof expect to find nonsense from here on in the student’s paper? 
b. What should the student have written? 
c. Complete the proof. 
22. A torsion group is a group all of whose elements have finite order. A group is torsion free if the identity is 


the only element of finite order. A student is asked to prove that if G is a torsion group, then so is G/H for 
every normal subgroup H of G. The student writes 


We must show that each element of G/H is of finite order. Let x € G/H. 
Answer the same questions as in Exercise 21. 


23. Mark each of the following true or false. 


a. It makes sense to speak of the factor group G/N if and only if N is a normal subgroup of the group 
G. 


b. Every subgroup of an abelian group G is a normal subgroup of G. 
c. An inner automorphism of an abelian group must be just the identity map. 
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d. Every factor group of a finite group is again of finite order. 

e. Every factor group of a torsion group is a torsion group. (See Exercise 22.) 
f. Every factor group of a torsion-free group is torsion free. (See Exercise 22.) 
g. Every factor group of an abelian group is abelian. 


h. Every factor group of a nonabelian group is nonabelian. 


i. Z/nZ is cyclic of order n. 
j. R/nR is cyclic of order n, where nR = {nr |r € R} and R is under addition. 


Theory 


24. 


25. 


26. 


27. 


28. 


29. 
30. 
31. 
32. 


33. 


34. 
35. 


36. 


37 


. 


38. 


39. 


Show that A, is a normal subgroup of S,, and compute S;,,/A,; that is, find a known group to which S,/Aj, is 
isomorphic. 

Complete the proof of Theorem 14.4 by showing that if H is a subgroup of a group G and if left coset 
multiplication (@H)(bH) = (ab)H is well defined, then Ha C aH. 

Prove that the torsion subgroup T of an abelian group G is a normal subgroup of G, and that G/T is torsion 
free. (See Exercise 22.) 

A subgroup H is conjugate to a subgroup K of a group G if there exists an inner automorphism i, of G such 
that i,[H] = K. Show that conjugacy is an equivalence relation on the collection of subgroups of G. 


Characterize the normal subgroups of a group G in terms of the cells where they appear in the partition given 
by the conjugacy relation in the preceding exercise. 


Referring to Exercise 27, find all subgroups of 53 (Example 8.7) that are conjugate to {o, L2}. 
Let H be a normal subgroup of a group G, and let m = (G : H). Show that a” € H foreverya € G. 
Show that an intersection of normal subgroups of a group G is again a normal subgroup of G. 


Given any subset S of a group G, show that it makes sense to speak of the smallest normal subgroup that 
contains §. [Hint: Use Exercise 31.] 


Let G be a group. An element of G that can be expressed in the form aba'b-' for some a,b € G isa 
commutator in G. The preceding exercise shows that there is a smallest normal subgroup C of a group G 
containing all commutators in G; the subgroup C is the commutator subgroup of G. Show that G/C is an 
abelian group. 


Show that if a finite group G has exactly one subgroup # of a given order, then H is anormal subgroup of G. 


Show that if H and N are subgroups of a group G, and N is normal in G, then HM N is normal in H. Show 
by an example that H 1 N need not be normal in G. 


Let G be a group containing at least one subgroup of a fixed finite order s. Show that the intersection of all 
subgroups of G of order s is a normal subgroup of G. [Hint: Use the fact that if H has order s, then so does 
x7! Hx for allx € GJ 


a. Show that all automorphisms of a group G form a group under function composition. 


b. Show that the inner automorphisms of a group G form a normal subgroup of the group of all automorphisms 
of G under function composition. [Warning: Be sure to show that the inner automorphisms do form a 
subgroup. ] 

Show that the set of all g € G such thati, : G > G is the identity inner automorphism i, is a normal subgroup 

of a group G. 


Let G and G’ be groups, and let H and H’ be normal subgroups of G and G’, respectively. Let @ be a 
homomorphism of G into G’. Show that ¢ induces a natural homomorphism @, : (G/H) > (G'/A’)if LA] C 
H’. (This fact is used constantly in algebraic topology.) 
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Part IIIT Homomorphisms and Factor Groups 


Use the properties det(AB) = det(A) - det(B) and det(Z,) = 1 for n x n matrices to show the following: 


a. The n x n matrices with determinant 1 form a normal subgroup of GL(n, R). 
b. The n x n matrices with determinant +1 form a normal subgroup of GL(m, R). 


Let G be a group, and let H(G) be the set of all subsets of G. For any A, B € A(G), let us define the product 
subset AB = {ab|a € A, be B}. 


a. Show that this multiplication of subsets is associative and has an identity element, but that /(G) is not a 
group under this operation. 

b. Show that if N is a normal subgroup of G, then the set of cosets of N is closed under the above operation 
on A(G), and that this operation agrees with the multiplication given by the formula in Corollary 14.5. 

c. Show (without using Corollary 14.5) that the cosets of N in G form a group under the above operation. Is 
its identity element the same as the identity element of A(G)? 


Factor-Grovur COMPUTATIONS AND SIMPLE GROUPS 


Factor groups can be a tough topic for students to grasp. There is nothing like a bit of com- 
putation to strengthen understanding in mathematics. We start by attempting to improve 
our intuition concerning factor groups. Since we will be dealing with normal subgroups 
throughout this section, we often denote a subgroup of a group G by N rather than by H. 

Let N be anormal subgroup of G. In the factor group G/N, the subgroup N acts as 
identity element. We may regard N as being collapsed to a single element, either to 0 in 
additive notation or to ¢ in multiplicative notation. This collapsing of N together with 
the algebraic structure of G require that other subsets of G, namely, the cosets of N, 
also collapse into a single element in the factor group. A visualization of this collapsing 
is provided by Fig. 15.1. Recall from Theorem 14.9 that y : G > G/N defined by 
y(a) = aN fora € Gisahomomorphism of G onto G/N. Figure 15.1 is very similar to 
Fig. 13.14, but in Fig. 15.1 the image group under the homomorphism is actually formed 
from G. We can view the “line” G/N at the bottom of the figure as obtained by collapsing 
to a point each coset of N in another copy of G. Each point of G/N thus corresponds 
to a whole vertical line segment in the shaded portion, representing a coset of N in 
G. It is crucial to remember that multiplication of cosets in G/N can be computed by 
multiplying in G, using any representative elements of the cosets as shown in the figure. 


| | | | 
i | | | 
| | | | 
| | | | 
di 


t —— i a oo 
aN N bN (cN)(bN) (ab)N 
=(ch)N = (aN)(bN) 


15.1 Figure 


15.2 Example 


Solution 


15.3 Example 


Solution 


15.4 Example 


15.5 Table 
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Additively, two elements of G will collapse into the same element of G/N if they 
differ by an element of N. Multiplicatively, a and b collapse together if ab"! is in N. 
The degree of collapsing can vary from nonexistent to catastrophic. We illustrate the two 
extreme cases by examples. 


The trivial subgroup N = {0} of Z is, of course, a normal subgroup. Compute Z/{O}. 


Since N = {0} has only one element, every coset of N has only one element. That is, 
the cosets are of the form {m} for m € Z. There is no collapsing at all, and consequently, 
Z/{O} ~ Z. Each m € Z is simply renamed {m} in Z/{O}. A 


Let n be a positive integer. The setnR = {nr |r € R} is a subgroup of R under addition, 
and it is normal since R is abelian. Compute R/nR. 


A bit of thought shows that actually n7R = R, because each x € R is of the form n(x/n) 
and x/n ¢ IR. Thus R/nR has only one element, the subgroup nR. The factor group is 
a trivial group consisting only of the identity element. A 


As illustrated in Examples 15.2 and 15.3 for any group G, we have G/{e} ~ G 
and G/G ~ {e}, where {e} is the trivial group consisting only of the identity element e. 
These two extremes of factor groups are of little importance. We would like knowledge 
of a factor group G/N to give some information about the structure of G. If N = {e}, 
the factor group has the same structure as G and we might as well have tried to study G 
directly. If N = G, the factor group has no significant structure to supply information 
about G. If G is a finite group and N # {e} is a normal subgroup of G, then G/N isa 
smaller group than G, and consequently may have a more simple structure than G. The 
multiplication of cosets in G/N reflects the multiplication in G, since products of cosets 
can be computed by multiplying in G representative elements of the cosets. 

We give two examples showing that even when G/N has order 2, we may be able to 
deduce some useful results. If G is a finite group and G/N has just two elements, then 
we must have |G{ = 2|N|. Note that every subgroup A containing just half the elements 
of a finite group G must be a normal subgroup, since for each element a in G but not in 
H, both the left coset aH and the right coset Ha must consist of all elements in G that 
are not in H. Thus the left and right cosets of H coincide and H is a normal subgroup 
of G. 


Because |.S,,| = 2|A,|, we see that A, is anormal subgroup of S,,, and S,,/A, has order 2. 
Let co be an odd permutation in S,, so that S,,/A, = {An, oA,}. Renaming the element 
A, “even” and the element o A, “odd,” the multiplication in S,/A, shown in Table 15.5 
becomes 


(even)(even) = even (odd){even) = odd 
(even)(odd) = odd (odd)(odd) = even. 


Thus the factor group reflects these multiplicative properties for all the permutations in 
Sn. A 
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15.6 Example 


15.7 Example 


Homomorphisms and Factor Groups 


Example 15.4 illustrates that while knowing the product of two cosets in G/N does 
not tell us what the product of two elements of G is, it may tell us that the product in G 
of two types of elements is itself of a certain type. 


(Falsity of the Converse of the Theorem of Lagrange) The theorem of Lagrange 
states if H is a subgroup of a finite group G, then the order of H divides the order of G. 
We show that it is false that if d divides the order of G, then there must exist a subgroup 
H of G having order d. Namely, we show that Ay, which has order 12, contains no 
subgroup of order 6. 

Suppose that H were a subgroup of Aq having order 6. As observed before in 
Example 15.4, it would follow that H would be a normal subgroup of Aq. Then Aq/H 
would have only two elements, H and o H for some o € A, not in H. Since in a group 
of order 2, the square of each element is the identity, we would have HH = HA and 
(o H)(o H) = H. Now computation in a factor group can be achieved by computing 
with representatives in the original group. Thus, computing in Ag, we find that for each 
a € H we must have a? € H and for each 8 € oH we must have B° € H. That is, the 
square of every element in Aq must be in H. But in Ay, we have 


(1,253) = 53, 2% and, <)3, 2) = 1,237 


so (1, 2, 3) and (1, 3, 2) are in H. A similar computation shows that (1, 2, 4), (1, 4, 2), 
(1, 3, 4), C1, 4, 3), (2, 3, 4), and (2, 4, 3) are all in H. This shows that there must be at 
least 8 elements in H, contradicting the fact that H was supposed to have order6.  & 


We now turn to several examples that compute factor groups. If the group we start 
with is finitely generated and abelian, then its factor group will be also. Computing sucha 
factor group means classifying it according to the fundamental theorem (Theorem 11.12). 


Let us compute the factor group (Za x Zs)/((0, 1)). Here ((0, 1)) is the cyclic subgroup 
H of Zy x Ze generated by (0, 1). Thus 


H = {(0. 0), (0, 1), O. 2), (0, 3), (0, 4), ©, 5)}. 


Since Z4 x Ze has 24 elements and H has 6 elements, all cosets of H must have 
6 elements, and (Z4 x Zs)/H must have order 4. Since Zy x Ze is abelian, so is 
(Za x Ze)/H (remember, we compute in a factor group by means of representatives 
from the original group). In additive notation, the cosets are 


H =(0,0)+ A, (1,0) + #H, (2,0) + H, (3,0) + H. 


Since we can compute by choosing the representatives (0, 0), (1, 9), (2, 0), and (3, 0), it 
is clear that (Z4 x Ze)/H is isomorphic to Z4. Note that this is what we would expect, 
since in a factor group modulo H, everything in H becomes the identity element; that is, 
we are essentially setting everything in H equal to zero. Thus the whole second factor 
Ze of Za x Ze is collapsed, leaving just the first factor Zy. A 


Example 15.7 is a special case of a general theorem that we now state and prove. 
We should acquire an intuitive feeling for this theorem in terms of collapsing one of the 
factors to the identity element. 


15.8 Theorem 


Proof 


15.9 Theorem 


Proof 


15.10 Example 


15.11 Example 
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Let G = H x K be the direct product of groups H and K. Then H = {th,e)|he H} 
is a normal subgroup of G. Also G/H is isomorphic to K in a natural way. Similarly, 
G/K = d ina natural way. 


Consider the homomorphism 22: H x K — K, where 72(h,k) =k. (See Example 
13.8). Because Ker(z) = H, we see that H isa normal subgroup of H x K. Because 
mz is onto K, Theorem 14.11 tells us that (H x K)/H =~ K. 5 


We continue with additional computations of abelian factor groups. To illustrate 
how easy it is to compute in a factor group if we can compute in the whole group, we 
prove the following theorem. 


A factor group of a cyclic group is cyclic. 


Let G be cyclic with generator a, and let N be a normal subgroup of G. We claim 
the coset aN generates G/N. We must compute all powers of aN. But this amounts to 
computing, in G, all powers of the representative a and all these powers give all elements 
in G. Hence the powers of aN certainly give all cosets of N and G/N is cyclic. Sd 


Let us compute the factor group (Z4 x Ze)/((0, 2)). Now (0, 2) generates the subgroup 
H = {(0, 0), (0, 2), (0, 4)} 


of Zs x Zs of order 3. Here the first factor Z4 of Z4 x Ze is left alone. The Ze factor, 
on the other hand, is essentially collapsed by a subgroup of order 3, giving a factor group 
in the second factor of order 2 that must be isomorphic to Z2. Thus (Z4 x Ze)/((O, 2)) 
is isomorphic to Z4 x Zp. A 


Let us compute the factor group (Z4 x Ze)/((2, 3)). Be careful! There is a great temp- 
tation to say that we are setting the 2 of Z, and the 3 of Ze, both equal to zero, so that 
Za is collapsed to a factor group isomorphic to Z and Z¢ to one isomorphic to Zs, giving 
a total factor group isomorphic to Z x Z3. This is wrong! Note that 


H = (2, 3)) = {(, 9), 2, 3)} 


is of order 2, so (Z4 x Ze)/((2. 3)) has order 12, not 6. Setting (2, 3) equal to zero 
does not make (2, 0) and (0, 3) equal to zero individually, so the factors do not collapse 
separately. 

The possible abelian groups of order 12 are Z4 x Z3 and Z) x Z) x Zs, and we 
must decide to which one our factor group is isomorphic. These two groups are most 
easily distinguished in that Z, x Z3 has an element of order 4, and Z, x Zz x Z3 does 
not. We claim that the coset (1, 0)+ H is of order 4 in the factor group (Z4 x Z,)/H. 
To find the smallest power of a coset giving the identity in a factor group modulo H, we 
must, by choosing representatives, find the smallest power of a representative that is in 
the subgroup H. Now, 


4(1, 0) = (1, 0) + (1, 0) + C1, 0) + 1, 0) = (0, 0) 


is the first time that (1, 0) added to itself gives an element of H. Thus (Zg x Zg)/((2, 3)) 
has an element of order 4 and is isomorphic to Z4 x Z3 or Zyp. A 
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Part HI 


15.12 Example 


Homomorphisms and Factor Groups 


Letus compute (that is, classify as in Theorem 11.12 the group (Z x Z)/((1, 1)). We may 
visualize Z x Z as the points in the plane with both coordinates integers, as indicated 
by the dots in Fig. 15.13. The subgroup ((1, 1)) consists of those points that lie on the 
45° line through the origin, indicated in the figure. The coset (1, 0) + (C1, 1)) consists of 
those dots on the 45° line through the point (1, 0), also shown in the figure. Continuing, 
we see that each coset consists of those dots lying on one of the 45° lines in the figure. 
We may choose the representatives 


Seer (-3, 0), (-2, 0), (-1, 0), (0, 0), d, 0), Q2, 0), G, 0), coe 


of these cosets to compute in the factor group. Since these representatives correspond 
precisely to the points of Z on the x-axis, we see that the factor group (Z x Z)/((1, 1)) 
is isomorphic to Z. A 


Y 


15.13 Figure 


Simple Groups 


As we mentioned in the preceding section, one feature of a factor group is that it gives 
crude information about the structure of the whole group. Of course, sometimes there 
may be no nontrivial proper normal subgroups. For example, Theorem 10.10 shows that 
a group of prime order can have no nontrivial proper subgroups of any sort. 


15.14 Definition 


15.15 Theorem 


Proof 


15.16 Theorem 


15.17 Definition 
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A group is simple if it is nontrivial and has no proper nontrivial normal subgroups. 


The alternating group A, is simple for n = 5. 


See Exercise 39. tg 


There are many simple groups other than those given above. For example, As is of 
order 60 and Ag is of order 360, and there is a simple group of nonprime order, namely 
168, between these orders. 

The complete determination and classification of all finite simple groups were re- 
cently completed. Hundreds of mathematicians worked on this task from 1950 to 1980. 
It can be shown that a finite group has a sort of factorization into simple groups, where 
the factors are unique up to order. The situation is similar to the factorization of positive 
integers into primes. The new knowledge of all finite simple groups can now be used to 
solve some problems of finite group theory. 

We have seen in this text that a finite simple abelian group is isomorphic to Z, for 
some prime p. In 1963, Thompson and Feit [21] published their proof of a longstanding 
conjecture of Burnside, showing that every finite nonabelian simple group is of even 
order. Further great strides toward the complete classification were made by Aschbacher 
in the 1970s. Early in 1980, Griess announced that he had constructed a predicted 
“monster” simple group of order 


808, 017, 424, 794, 512, 875, 886, 459, 904, 961, 710, 757, 005, 754, 368, 
000, 000, 000. 


Aschbacher added the final details of the classification in August 1980. The research 
papers contributing to the entire classification fill roughly 5000 journal pages. 

We turn to the characterization of those normal subgroups N of a group G for which 
G/N is a simple group. First we state an addendum to Theorem 13.12 on properties of 
a group homomorphism. The proof is left to Exercises 35 and 36. 


Let 6 : G > G’ bea group homomorphism. If N is anormal subgroup of G, then @[N] 
is anormal subgroup of @[G]. Also, if N’ is a normal subgroup of ¢[G], then ¢ LN] 
is anormal subgroup of G. 


Theorem 15.16 should be viewed as saying that a homomorphism ¢: G > G’ 
preserves normal subgroups between G and ¢[G]. It is important to note that ¢[N] may 
not be normal in G’, even though N is normal in G. For example, ¢ : Z) — $3, where 
(0) = po and @(1) = 4, is a homomorphism, and Z, is a normal subgroup of itself, 
but {, 441} is not a normal subgroup of $3. 

We can now characterize when G/N is a simple group. 


A maximal normal subgroup of a group G is a normal subgroup M not equal to G 
such that there is no proper normal subgroup N of G properly containing M. a 
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15.18 Theorem 
Proof 


15.19 Example 


15.20 Theorem 


Homomorphisms and Factor Groups 


M is a maximal normal subgroup of G if and only if G/M is simple. 


Let M be a maximal normal subgroup of G. Consider the canonical homomorphism 
y :G— G/M given by Theorem 14.9. Now y~! of any nontrivial proper normal sub- 
group of G/M is a proper normal subgroup of G properly containing M. But M is 
maximal, so this can not happen. Thus G/M is simple. 

Conversely, Theorem 15.16 shows that if N is a normal subgroup of G properly 
containing M, then y[N] is normal in G/M. If also N # G, then 


yIN]#G/M = and sy [N] # {M}. 


Thus, if G/M is simple so that no such y[N] can exist, no such N can exist, and M is 
maximal. 4 


The Center and Commutator Subgroups 


Every nonabelian group G has two important normal subgroups, the center Z(G) of 
G and the commutator subgroup C of G. (The letter Z comes from the German word 
zentrum, meaning center.) The center Z(G) is defined by 


Z(G) = {z € G\zg = gz forall g € G}. 


Exercise 52 of Section 5 shows that Z(G) is an abelian subgroup of G. Since for each g € 
Gandz € Z(G) we have gzg—'! = zgg7! = ze = z, we see at once that Z(G) is anormal 
subgroup of G. If G is abelian, then Z(G) = G; in this case, the center is not useful. 


The center of a group G always contains the identity element e. It may be that Z(G) = {e}, 
in which case we say that the center of G is trivial. For example, examination of Table 8.8 
for the group 53 shows us that Z(.53) = {po}, so the center of 3 is trivial. (This is a special 
case of Exercise 38, which shows that the center of every nonabelian group of order pq 
for primes p and gq is trivial.) Consequently, the center of 5; x Zs must be {9} x Zs, 
which is isomorphic to Zs. A 


Turning to the commutator subgroup, recall that in forming a factor group of G 
modulo a normal subgroup N, we are essentially putting every element in G that isin N 
equal to e, for N forms our new identity in the factor group. This indicates another usc for 
factor groups. Suppose, for example, that we are studying the structure of a nonabelian 
group G. Since Theorem 11.12 gives complete information about the structure of all 
sufficiently small abelian groups, it might be of interest to try to form an abelian group 
as much like G as possible, an abelianized version of G, by starting with G and then 
requiring that ab = ba for all a and b in our new group structure. To require that ab = ba 
is to say that aba~'b~! =e in our new group. An element aba~'b~! in a group is a 
commutator of the group. Thus we wish to attempt to form an abelianized version of G 
by replacing every commutator of G by e. By the first observation of this paragraph, we 
should then attempt to form the factor group of G modulo the smallest normal subgroup 
we can find that contains all commutators of G. 


Let G bea group. The set of all commutators aba~'b | fora, b € G generates a subgroup 
C (the commutator subgroup) of G. This subgroup C is a normal subgroup of G. 
Furthermore, if N is anormal subgroup of G, then G/N is abelian if and only ifC < N. 


Proof 


15.21 Example 
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The commutators certainly generate a subgroup C; we must show that it is normal in 
G. Note that the inverse (aba~'b7!)~! of a commutator is again a commutator, namely, 
bab-‘a7'. Also e = eee~'e7! is a commutator. Theorem 7.6 then shows that C consists 
precisely of all finite products of commutators. For x € C, we must show that g~!xg € C 
for all g € G, or that if x is a product of commutators, so is g~!xg for all g € G. By 
inserting e = gg! between each product of commutators occurring in x, we see that it 
is sufficient to show for each commutator cdc7'd7! that g~!(ede~!d7')g is in C. But 


g ede dg = (gtede“!)\(ey(d7!g) 
=(g 'edce '\gd ‘dg '\d~'g) 
= [(g ‘e)d(g7'c)"'d~'I[dg7'd7' 1, 


which is in C. Thus C is normal in G. 
The rest of the theorem is obvious if we have acquired the proper feeling for factor 
groups. One doesn’t visualize in this way, but writing out that G/C is abelian follows from 


(aC)(bC) = abC = ab(b™!a7!ba)C 
= (abb—a7!)baC = baC = (bC)(aC). 
Furthermore, if NV is anormal subgroup of G and G/N is abelian, then (a N\b-!N) = 
(b-1N)(a-1N); that is, aba~!b-1N = N, so aba~'b7! € N, and C < N. Finally, if 
C <N, then 
(aN\(bN) = abN = ab(b“'a|ba)N 


= (abb-'a')baN = baN = (bN\(aN). ‘ 
For the group $3 in Table 8.8, we find that one commutator is piltipy {ey = Pf P21 
= [13[l2 = p. We similarly find that p.y1py' wy! = pri pil = fla3 = f1. Thus the 
commutator subgroup C of $3 contains A3. Since A3 is a normal subgroup of $3 and 
S3/A3 is abelian, Theorem 15.20 shows that C = A3. A 


@ EXERCISES 15 


Computations 


In Exercises 1 through 12, classify the given group according to the fundamental theorem of finitely generated 
abelian groups. 


» (Zz x Za)/((O, 1)) 
» (Zy x Z4)/ (CA, 2)) 
. (Z4 X Z4 x Zg)/((, 2, 4)) 


- (Zx Z)/((, 2)) 


» (Zy x Z4)/((0, 2)) 

» (Ze x Zg)/(C, 2)) 

» (Z x Z)/((0, 1) 

» (ZX Zx Z)/((, 1,1) 


on kN 


» x 2x Za4)/(G3, 0, 9)) 10. (Z x Z x Zg)/((0, 4, 0)) 


» Zx Z)/{Q, 2)) 


12. (2x Zx Z)/((3, 3, 3)) 
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14. 
15. 
16. 


Part II] Homomorphisms and Factor Groups 


Find both the center Z(D4) and the commutator subgroup C of the group D4 of symmetries of the square in 
Table 8.12. 


Find both the center and the commutator subgroup of Z3 x 33. 
Find both the center and the commutator subgroup of $3 x D4. 


Describe all subgroups of order < 4 of Z x Z4, and in each case classify the factor group of Z4 x Z4 modulo 
the subgroup by Theorem 11.12. That is, describe the subgroup and say that the factor group of Z, x Z4 modulo 
the subgroup is isomorphic to Z2 x Z4, or whatever the case may be. [Hint: Z, x Z, has six different cyclic 
subgroups of order 4. Describe them by giving a generator, such as the subgroup ((1, 0)). There is one subgroup 
of order 4 that is isomorphic to the Klein 4-group. There are three subgroups of order 2.] 


Concepts 


In Exercises 17 and 18, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 


17, 
18. 
19. 


The center of a group G contains all elements of G that commute with every element of G. 
The commutator subgroup of a group G is {a~'b-!ab| a, b € G}. 
Mark each of the following true or false. 


a. Every factor group of a cyclic group is cyclic. 


b. A factor group of a noncyclic group is again noncyclic. 

c. R/Z under addition has no element of order 2. 

d. R/Z under addition has elements of order n for alln € ZT. 

e. R/Z under addition has an infinite number of elements of order 4. 

f. If the commutator subgroup C of a group G is {e}, then G is abelian. 
g. 
h, 


If G/H is abelian, then the commutator subgroup of C of G contains H. 
The commutator subgroup of a simple group G must be G itself. 


i. The commutator subgroup of a nonabelian simple group G must be G itself. 


j. All nontrivial finite simple groups have prime order. 


In Exercises 20 through 23, let F be the additive group of all functions mapping R into R, and let F* be the 
multiplicative group of all elements of F that do not assume the value 0 at any point of R. 


20. 


21. 


22. 


23. 


Let K be the subgroup of F consisting of the constant functions. Find a subgroup of F to which F/K is 
isomorphic. 


Let K* be the subgroup of F* consisting of the nonzero constant functions. Find a subgroup of F* to which 
F*/K* is isomorphic. 

Let K be the subgroup of continuous functions in F. Can you find an element of F/K having order 2? Why 
or why not? 


Let K* be the subgroup of F* consisting of the continuous functions in F*. Can you find an element of F*/K* 
having order 2? Why or why not? 


In Exercises 24 through 26, let U be the multiplicative group {z € C | |z| = 1}. 


24. 
25. 
26. 
27. 


Let z € U. Show that zpU = {zoz|z € U} is a subgroup of U, and compute U/zgU. 
To what group we have mentioned in the text is U/(—1) isomorphic? 
Let ¢, = cos(2x/n) +i sin(2x/n) where n € Zt. To what group we have mentioned is U/(¢,) isomorphic? 


To what group mentioned in the text is the additive group R/Z isomorphic? 


28. 


29. 


30. 


31. 
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Give an example of a group G having no elements of finite order > 1 but having a factor group G/A, all of 
whose elements are of finite order. 


Let H and K be normal subgroups of a group G. Give an example showing that we may have H ~ K while 
G/H is notisomorphic to G/K. 


Describe the center of every simple 


a. abelian group 
b. nonabelian group. 


Describe the commutator subgroup of every simple 


a. abelian group 
b. nonabelian group. 


Proof Synopsis 


32. 
33. 


Give a one-sentence synopsis of the proof of Theorem 15.9. 


Give at most a two-sentence synopsis of the proof of Theorem 15.18. 


Theory 


34. 
35. 


36. 


37. 


38. 
39. 


Show that if a finite group G contains a nontrivial subgroup of index 2 in G, then G is not simple. 


Let ¢ : G ~ G’ be a group homomorphism, and let N be a normal subgroup of G. Show that é[.N] is normal 
subgroup of é[G]. 


Let ¢ : G > G’ be a group homomorphism, and let N’ be a normal subgroup of G'. Show that @![N’] is a 
normal subgroup of G. 


Show that if G is nonabelian, then the factor group G/Z(G) is not cyclic. [Hint: Show the equivalent contra- 
positive, namely, that if G/Z(G) is cyclic then G is abelian (and hence Z(G) = G).] 


Using Exercise 37, show that a nonabelian group G of order pg where p and gq are primes has a trivial center. 

Prove that A, is simple for n > 5, following the steps and hints given. 

a. Show A, contains every 3-cycle ifn > 3. 

b. Show A, is generated by the 3-cycles for n > 3. [Hint: Note that (a, b)(c, d) = (a, c, b)(a, c,d) and 
(a, c)(a, b) = (a, b, c).] 

c. Let r and s be fixed elements of {1,2,---,} for n > 3. Show that A, is generated by the n “special” 
3-cycles of the form (r, s,i) for 1 <i <n [Hint: Show every 3-cycle is the product of “special” 3-cycles 
by computing 

CL Gelaal, Gees, 
and 


GPAy as OO SPY (S21). 
Observe that these products give all possible types of 3-cycles.] 
d. Let N be anormal subgroup of A,, forn > 3. Show thatif N contains a 3-cycle, then N = A,. [Hint: Show 
that (r, s,i) € N implies that (7, s, 7) € N for 7 = 1,2, +++, by computing 
G.9G MOSH IED] 


e. Let N be a nontrivial normal subgroup of A, for n > 5. Show that one of the following cases must hold, 
and conclude in each case that N = Ay. 
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Casel 


Case IT 


Case Il 


Case IV 


Case V 


N contains a 3-cycle. 


N contains a product of disjoint cycles, at least one of which has length greater than 3. [Hint: Suppose 
N contains the disjoint product 0 = pe(ay, az, +++, a). Show 0 !(ay, a2, a3)0(a1, a2, a3)! isin N, 
and compute it.] 

N contains a disjoint product of the form o = p4(a4, a5, a6)(a1, 42, 43). [Hint: Show ao (a), a2, a4) 
o(a), a2, a4)~! is in N, and compute it.] 

N contains a disjoint product of the forma = (ay, a2, a3) where yz is a product of disjoint 2-cycles. 
[Hint: Show o? € N and compute it.] 


N contains a disjoint product o of the form o = j4(a3, a4)(a;, a2), where yz is a product of an even 
number of disjoint 2-cycles. [Hint: Show that ao !(ay, a2, 43)0 (a), a2, 23)! is in N, and compute 
it to deduce that a = (a2, a4)(a;, 43) is in N. Using n > 5 for the first time, find i 4 ay, do, a3, a4 
in {1,2,.--,n}. Let 8 = (a1, a3, i). Show that BolaBa € N, and compute it.] 


40. Let N be a normal subgroup of G and let H be any subgroup of G. Let HN = {hn|A ¢ H,n € N}. Show that 
FIN is asubgroup of G, and is the smallest subgroup containing both N and H. 


41. With reference to the preceding exercise, let M also be a normal subgroup of G. Show that NM is again a 
normal subgroup of G. 


42. Show that if H and K are normal subgroups of a group G such that HM K = {e}, then hk = kh forallh ¢ H 
and k € K. [Hint: Consider the commutator hkh-1k7! = (hkh-)k7! = hkh-'k}).] 


'Grour ACTION ON A SET 


We have seen examples of how groups may act on things, like the group of symmetries 
of a triangle or of a square, the group of rotations of a cube, the general linear group 
acting on R", and so on. In this section, we give the general notion of group action on a 
set. The next section will give an application to counting. 


The Notion of a Group Action 


Definition 2.1 defines a binary operation * on a set S to be a function mapping S x $ 
into S. The function * gives us a rule for “multiplying” an element s, in S and an element 
s2 in S to yield an element sj * s2 in S. 

More generally, for any sets A, B, and C, we can view a map *: A x B > C as 
defining a “multiplication,” where any element a of A times any element b of B has as 
value some element c of C. Of course, we write a * b = c, or simply ab = c. In this 
section, we will be concerned with the case where X is a set, G is a group, and we have 
amap*«: Gx X — X. We shall write *(g, x) as g * x or gx. 


16.1 Definition Let X be a set and G a group. An action of G on X isamap * : G x X — X such that 


| 
1. ex =x forallx € X, 
2. (g192)(¢) = g1(g2x) for all x € X and all 91, g2 EG. 


Under these conditions, X is a G-set. 


t This section is a prerequisite only for Sections 17 and 36. 


16.2 Example 


16.3 Theorem 


Proof 
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Let X be any set, and let H be a subgroup of the group Sy of all permutations of X. 
Then X is an H-set, where the action of o € H on X is its action as an element of 
Sy, so that ox = o(x) for all x € X. Condition 2 is a consequence of the definition of 
permutation multiplication as function composition, and Condition 1 is immediate from 
the definition of the identity permutation as the identity function. Note that, in particular, 
{1,2,3,:.--,n}is an S, set. A 


Our next theorem will show that for every G-set X and each g € G, the map 
o, : X + X defined by o,(x) = gx is a permutation of X, and that there is a homomor- 
phism ¢ : G —> Sx such that the action of G on X is essentially the Example 16.2 action 
of the image subgroup H = ¢[G] of Sy on X. So actions of subgroups of Sx on X de- 
scribe all possible group actions on X. When studying the set X, actions using subgroups 
of Sy suffice. However, sometimes a set X is used to study G via a group actionof Gon X. 
Thus we need the more general concept given by Definition 16.1. 


Let X be a G-set. For each g € G, the function o, : X — X defined by o,(x) = gx 
for x € X is a permutation of X. Also, the map ¢ : G > Sx defined by ¢(g) = o, isa 
homomorphism with the property that @(g)(x) = gx. 


To show that o, is a permutation of X, we must show that it is a one-to-one map 
of X onto itself. Suppose that o9(x1) = og(x2) for x1, x2 € X. Then gx; = gx2. Con- 
sequently, g~!(gx1) = g '(gx2). Using Condition 2 in Definition 16.1, we see that 
(g-!g)x1 = (g~'g)x2, SO ex) = ex. Condition 1 of the definition then yields x; = x2, 
SO dy is one to one. The two conditions of the definition show that for x € X, we have 
Og(g 'Xx) = g(g7!)x = (gg7!)x = ex =x, 80 0, maps X onto X. Thus oy is indeed a 
permutation. 

To show that 6: G — Sy defined by @(g) = og is a homomorphism, we must 
show that $(g1¢2) = @(g1)@(g2) for all g1, g2 € G. We show the equality of these two 
permutations in Sy by showing they both carry an x € X into the same element. Us- 
ing the two conditions in Definition 16.1 and the rule for function composition, we 
obtain 


P(21 22) X) = Ge,9.(X) = (9182)% = B1( 82x) = 210g, (X) = Fg, (Oy,(%)) 
= (04,0 Og, )(X) = (Fg, Fg, (x) = (H(81)6(82))(X). 


Thus ¢ is a homomorphism. The stated property of @ follows at once since by our 
definitions, we have @(g)(x) = o,(x) = gx. a 


it follows from the preceding theorem and Theorem 13.15 that if X is G-set, then 
the subset of G leaving every element of X fixed is a normal subgroup N of G, and we 
can regard X as a G/N-set where the action of a coset gN on X is given by (geN)x = gx 
for eachx € X.If N = {e}, then the identity element of G is the only element that leaves 
every x € X fixed; we then say that G acts faithfully on X. A group G is transitive on 
a G-set X if for each x1, x. € X, there exists g € G such that gx; = x2. Note that G is 
transitive on X if and only if the subgroup @[G] of Sy is transitive on X, as defined in 
Exercise 49 of Section 8. 

We continue with more examples of G-sets. 
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Part II 


16.4 Example 


16.5 Example 


16.6 Example 


16.7 Example 


16.8 Example 


Homomorphisms and Factor Groups 


Every group G is itself a G-set, where the action on g2 € G by g; € G is given by left 
multiplication. That is, *(g1, 22) = 2122. If H is asubgroup of G, we can also regard G 
as an H-set, where #(h. g) = hg. A 


Let H be asubgroup of G. Then G is an H-set under conjugation where *(h, g) = hgh} 
for g € Gandh € H. Condition 1 is obvious, and for Condition 2 note that 


#(hyho, g) = (hihz)g(hihr) * = hi(haghy')hy! = *(1, *(A2, 8). 


We always write this action of H on G by conjugation as hgh~'. The abbreviation hg 
described before the definition would cause terrible confusion with the group operation 
of G. A 


For students who have studied vector spaces with real (or complex) scalars, we mention 
that the axioms (rs)v = r(sv) and 1v = v for scalars r and s and a vector v show that 
the set of vectors is an R*-set (or a C*-set) for the multiplicative group of nonzero 
scalars. A 


Let H be a subgroup of G, and let Ly be the set of all left cosets of H. Then Ly is 
a G-set, where the action of g € G on the left coset x7 is given by g(4H) = (gx)H. 
Observe that this action is well defined: if yH =xH, then y = xh for some h € H, 
and g(yH) = (gy)H = (exh) = (gx)(hH) = (gx)H = g(xfA). A series of exercises 
shows that every G-set is isomorphic to one that may be formed using these left coset 
G-sets as building blocks. (See Exercises 14 through 17.) A 


Let G be the group D4 = {o, (1, 02. 03, [lt, [42, 51. 62} of symmetries of the square, 
described in Example 8.10. In Fig. 16.9 we show the square with vertices 1, 2, 3, 4 as 
in Fig. 8.11. We also label the sides 51, 52, 93, 84, the diagonals d, and d2, vertical and 
horizontal axes m1 and mo, the center point C, and midpoints P; of the sides s;. Recall 
that p; corresponds to rotating the square counterclockwise through wi/2 radians, 4; 


P3 53 
4 3 
$4 
P, P; 
SQ 
1 3 
Sy Py 


16.9 Figure 
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16.10 Table 

1 2 3 4 s sp ss; Sa m, mz ad dy C P, Pr Py, Py 
p 1 2 3 4 5; So 53 Sy omy, my dy dy C P, Pr Py Py 
pi 2 3 4 1 gs 83 Se SS; Mz m, dy Ay C Py Ps BF PB 
po 3 4 1 2 S53 S4 Sy So my Mo dy ad Cc P3 Py Pi Py, 
P3 4 1 2 3 S4 Ss] AY) S33 Wha my dy d; C P, P, P, P3 
By 2 1 4 3 S} S4 53 S52 My M2 ay a; C P, P, P, P, 
My 4 3 2 1 s3 sy sy Sy omy mz ds dy C P,; Py Pi Ps 
6, 3 2 1 4 AY) Sy S4 S53 Mo my, a, dy C Py, P; Py P3 
by 1 4 3 2 S54 S3 S2 AMI Ma My dy ad Cc Py P3 P, P, 


corresponds to flipping on the axis m;, and 6; to flipping on the diagonal d;. We let 
X = {1, 2,3. 4, 51, $2, 53, 84, m1, m2, dy, dz, C, P, Po, P3, Pa}. 


Then X can be regarded as a D4-set inanatural way. Table 16.10 describes completely the 
action of D4 on X and is given to provide geometric illustrations of ideas to be introduced. 
We should be sure that we understand how this table is formed before continuing. A 


Isotropy Subgroups 


Let X be a G-set. Let x € X and g € G. It will be important to know when gx = x. We 
let 


Xg={xEeX|gx =x} and Gy, ={g € G| gx =x}. 


16.11 Example For the D4-set X in Example 16.8, we have 
Xp. =X, Xp, = {C}, Xp, = 181, 53.1, M2, C, Pi, P3} 
Also, with G = Dg, 
G1 = {p0, 62}, Gs, = {n, Li}. Ga, = {P0. 2.51, 82}. 
We leave the computation of the other X, and G,, to Exercises 1] and 2. A 


Note that the subsets G, given in the preceding example were, in each case, sub- 
groups of G. This is true in general. 


16.12 Theorem Let X be a G-set. Then G, is a subgroup of G for each x € X. 


Proof Let x € X and let gy, g2 € G,. Then gyx = x and g2x = x. Consequently, (g1g2)x = 
21(g2X) = 21X = X,$0 212) € G,, and G, is closed under the induced operation of G. Of 
course ex = x, soe € G,. If g € G,, then gx =x, sox =ex =(g7!g)x = g7 (gx) = 
g—'x, and consequently g~! € G,. Thus G, is a subgroup of G. ° 


16.13 Definition Let X be a G-set and let x ¢ X. The subgroup G, is the isotropy subgroup of x. 
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16.14 Theorem 


Proof 


16.15 Definition 


16.16 Theorem 


Proof 


Homomorphisms and Factor Groups 


Orbits 


For the Dy-set X of Example 16.8 with action table in Table 16.10, the elements in the 
subset {1, 2, 3, 4} are carried into elements of this same subset under action by D4. 
Furthermore, each of the elements 1, 2, 3, and 4 is carried into all the other elements of 
the subset by the various elements of D4. We proceed to show that every G-set X can 
be partitioned into subsets of this type. 


Let X be a G-set. For x1, € X, let x; ~ x2 if and only if there exists g ¢ G such that 
gx = x. Then ~ is an equivalence relation on Xx. 


For each x € X, we have ex = x, so x ~ x and ~ is reflexive. 
Suppose X1 ~~ 22, SO gx; =X. for some g €G. Then g 
(g—!g)x, = ex, = Xj, 90%) ~ x1, and ~ is symmetric. 
Finally, if x; ~ x2 and x2 ~ x3, then gix1 = x2 and gox2 = x3 for some g}, g2 € G. 
Then (9221)%1 = 92(g1%1) = 22%2 = X3, SO X1 ~ %3 and ~ is transitive. 


he =p ek) = 


Let X be a G-set. Each cell in the partition of the equivalence relation described in 
Theorem 16.14 is an orbit in X under G. If x € X, the cell containing x is the orbit 
of x. We let this cell be Gx. | 


The relationship between the orbits in X and the group structure of G lies at the 
heart of the applications that appear in Section 17. The following theorem gives this 
relationship. Recall that for a set X, we use |X| for the number of elements in X, and 
(G : H) is the index of a subgroup H ina group G. 


Let X be a G-set and let x € X. Then |Gx| =(G : G,). If |G| is finite, then |Gx| is a 
divisor of |G]. 


We define a one-to-one map y from Gx onto the collection of left cosets of G, in G. 
Let x; € Gx. Then there exists g; € G such that gx = x). We define y(x;) to be the 
left coset |G, of G,,. We must show that this map 7 is well defined, independent of the 
police of g, € : such that g1x = x. Suppose also that 81 'x = x,. Then, git'= — 81 'x, SO 
81 (gix) = = 2) “ig ‘x), from which we deduce x = (gi gx. Therefore g7 ta; € Gy, 
30 g1' € g1Gy, and giGy = gi; G,. Thus the map w is well defined. 

To show the map y is one to one, suppose x), x2 € Gx, and Wy) = w (x2). Then 
there exist g}, g2 € G such that x) = gix, x2 = gox, and go € giG,. Then gz = 218 for 
some g € Gy, 80 x7 = Box = gy(gx) = Bix = x1. Thus y is one to one. 

Finally, we show that each left coset of G, in G is of the form w(x.) for some 
x, € Gx. Let g,G, be a left coset. Then if gx = x1, we have g)Gy = W(1). Thus w 
maps Gx one to one onto the collection of right cosets so |Gx| = (G: Gx). 

If [G| is finite, then the equation |G| = |G,|(G : G,.) shows that |Gx| = (G : Gx) 
is a divisor of |G]. ¢ 


Section 16 Exercises 159 


16.17 Example Let X be the Dy-set in Example 16.8, with action table given by Table 16.10. With 
G = D4, wehave G1 = {1, 2, 3, 44and G; = {(0, 62}. Since |G| = 8, we have |G1| = 
(G:G)) =4. A 
We should remember not only the cardinality equation in Theorem 16.16 but also 
that the elements of G carrying x into 91x are precisely the elements of the left coset 
giGx. Namely, if g € G,, then(g) g)x = gi(gx) = gix. On the other hand, if g2x = gix, 
then 27 (g2x) =x so (gy 'g2)x = x. Thus 8) 22 & Gy, 80 go € g1Gy. 


@ EXERCISES 16 


Computations 
In Exercises 1 through 3, let 
X =({1, 2, 3, 4, 51,52, 53, $4.71, M2, dj, do, C, P|, Po, P3, Pa} 
be the D4-set of Example 16.8 with action table in Table 16.10. Find the following, where G = D4. 
1. The fixed sets X, foreacho € Dg, that is, Xp,,Xp,.+++, Xs, 
2. The isotropy subgroups G, for each x € X, that is, G,, Go,---, Gp,, Gp, 
3. The orbits in X under Dy 


Concepts 
In Exercises 4 and 5, correct the definition of the italicized term without reference to the text, if correction is needed, 
so that it is in a form acceptable for publication. 

4, A group G acts faithfully on X if and only if gx = x implies that g = e¢. 

5. A group G is transitive on a G-set X if and only if, for some g € G, gx can be every other x. 

6. Let X be a G-set and let S C X. If Gs C S forall s € S, then S is a sub-G-set. Characterize a sub-G-set of a 

G-set X in terms of orbits in X and G. 
7, Characterize a transitive G-set in terms of its orbits. 
8. Mark each of the following true or false. 


a. Every G-set is also a group. 

b. Each element of a G-set is left fixed by the identity of G. 

c. If every element of a G-set is left fixed by the same element g of G, then g must be the identity e. 
d. Let X be a G-set with x},.x. € X and g € G.If gx) = gx2, then x; = x2. 

e. Let X be a G-set with x € X and g), g. € G. If gx = gox, then g; = go. 

f. Each orbit of a G-set X is a transitive sub-G-set. 

g. Let X be a G-set and let H < G. Then X can be regarded in a natural way as an H-set. 

h. With reference to (g), the orbits in X under H are the same as the orbits in X under G. 

i. If X is a G-set, then each element of G acts as a permutation of X. 

j. Let X be a G-set and let x ¢ X. If G is finite, then |G] = |Gx| + |G,]. 

9. Let X and Y be G-sets with the same group G. An isomorphism between G-sets X and Y isamap¢: X > Y 


that is one to one, onto Y, and satisfies g4(x) = (gx) for all x € X and g € G. Two G-sets are isomorphic 
if such an isomorphism between them exists. Let X be the D4-set of Example 16.8. 
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Part IIT Homomorphisms and Factor Groups 


a. Find two distinct orbits of X that are isomorphic sub-D.-sets. 


b. Show that the orbits {1, 2, 3, 4} and {s, 52, 53, sq} are not isomorphic sub-D,-sets. [Hint: Find an element 
of G that acts in an essentially different fashion on the two orbits.] 


c. Are the orbits you gave for your answer to part (a) the only two different isomorphic sub-D,4-sets of X? 
Let X be the D4-set in Example 16.8. 

a. Does Daz act faithfully on X? 

b. Find all orbits in X on which Dg acts faithfully as a sub- D4-sct. 


Theory 


11. 


12. 


13. 


Let X be a G-set. Show that G acts faithfully on X if and only if no two distinct elements of G have the same 
action on each element of X. 


Let X be a G-set and let Y C X¥. Let Gy = {g € G| gy = y for all y € Y}. Show Gy is a subgroup of G, 
generalizing Theorem 16.12. 


Let G be the additive group of real numbers. Let the action of 9 € G on the real plane R? be given by rotating 
the plane counterclockwise about the origin through @ radians. Let P be a point other than the origin in the 
plane. 

a. Show R? is a G-set. 

b. Describe geometrically the orbit containing P. 

c. Find the group Gp. 


Exercises 14 through 17 show how all possible G-sets, up to isomorphism (see Exercise 9), can be formed from 
the group G. 


14. 


15. 
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17. 


Let {X; |i € J} be a disjoint collection of sets, so X; 1 X; = @ fori # j. Let each X; be a G-set for the same 
group G. 


a. Show that _),;.,X; can be viewed in a natural way as a G-set, the union of the G-sets X;. 
b. Show that every G-set X is the union of its orbits. 


Let X be a transitive G-set, and let x9 € X. Show that X is isomorphic (see Exercise 9) to the G-set Z of all 
left cosets of G,,, described in Example 16.7. [Hint: For x € X, suppose x = gxo, and define @ : X —> L by 
(x) = gG,,. Be sure to show ¢ is well defined!] 


Let X; for i ¢ J be G-sets for the same group G, and suppose the sets X; are not necessarily disjoint. Let 
X! = {(x,i)|x € X;} for eachi € J. Then the sets X; are disjoint, and each can still be regarded as a G-set in 
an obvious way. (The elements of X; have simply been tagged by i to distinguish them from the elements of 
X; fori # j.) The G-set LJ,_.,X; is the disjoint union of the G-sets X;. Using Exercises 14 and 15, show that 
every G-set is isomorphic to a disjoint union of left coset G-sets, as described in Example 16.7. 


The preceding exercises show that every G-set X is isomorphic to a disjoint union of left coset G-sets. The 
question then arises whether left coset G-sets of distinct subgroups H and K of G can themselves be isomorphic. 
Note that the map defined in the hint of Exercise 15 depends on the choice of xo as “base point.” If xo is replaced 
by goxo andif G,, A Ggqx., then the collections L y of left cosets of H = G,, and Lx ofleft cosets of K = G,,x, 
form distinct G-sets that must be isomorphic, since both Ly and Lx are isomorphic to X. 


a. Let X be a transitive G-set and let x» € X and go ¢ G. If H = G,, describe K = G,,,, in terms of H 
and go. 

b. Based on part (a), conjecture conditions on subgroups H and K of G such that the left coset G-sets of H 
and K are isomorphic. 


c. Prove your conjecture in part (b). 
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18. Up to isomorphism, how many transitive Z4 sets X are there? (Use the preceding exercises.) Give an example 
of each isomorphism type, listing an action table of each as in Table 16.10. Take lowercase names a, b, c, and 
so on for the elements in the set X. 


19, Repeat Exercise 18 for the group Ze. 


20. Repeat Exercise 18 for the group $3. List the elements of 53 in the order ¢, (1, 2, 3), (1, 3, 2), (2, 3), (1, 3), 
(1, 2). 


17.1 Theorem 


t APPLICATIONS OF G-SETS TO COUNTING 


This section presents an application of our work with G-sets to counting. Suppose, for 
example, we wish to count how many distinguishable ways the six faces of a cube can 
be marked with from one to six dots to form a die. The standard die is marked so that 
when placed on a table with the 1 on the bottom and the 2 toward the front, the 6 is on 
top, the 3 on the left, the 4 on the right, and the 5 on the back. Of course, other ways of 
marking the cube to give a distinguishably different die are possible. 

Let us distinguish between the faces of the cube for the moment and call them the 
bottom, top, left, right, front, and back. Then the bottom can have any one of six marks 
from one dot to six dots, the top any one of the five remaining marks, and so on. There 
are 6! = 720 ways the cube faces can be marked in all. Some markings yield the same 
die as others, in the sense that one marking can be carried into another by a rotation 
of the marked cube. For example, if the standard die described above is rotated 90° 
counterclockwise as we look down on it, then 3 will be on the front face rather than 2, 
but it is the same die. 

There are 24 possible positions of a cube on a table, for any one of six faces can be 
placed down, and then any one of four to the front, giving 6 . 4 = 24 possible positions. 
Any position can be achieved from any other by a rotation of the die. These rotations 
form a group G, which is isomorphic to a subgroup of Sg (see Exercise 45 of Section 8). 
We let X be the 720 possible ways of marking the cube and let G act on X by rotation of 
the cube. We consider two markings to give the same die if one can be carried into the 
other under action by an element of G, that is, by rotating the cube. In other words, we 
consider each orbit in X under G to correspond to a single die, and different orbits to 
give different dice. The determination of the number of distinguishable dice thus leads 
to the question of determining the number of orbits under G in a G-set X. 

The following theorem gives a tool for determining the number of orbits in a G- 
set X under G. Recall that for each g € G we let X, be the set of elements of X left 
fixed by g, so that X, = {x € X| gx =x}. Recall also that for each x € X, we let 
G, ={g € G| gx = x}, and Gx is the orbit of x under G. 


(Burnside’s Formula) Let G be a finite group and X a finite G-set. If r is the number 
of orbits in X under G, then 


r-|Gl= >> [Xe () 


geG 


1 This section is not used in the remainder of the text. 
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Part II 


Proof 


17.2 Corollary 


Proof 


17.3 Example 


Homomorphisms and Factor Groups 


We consider all pairs (g, x) where gx = x, and let NV be the number of such pairs. For 
each g € G there are |X,| pairs having g as first member. Thus, 


N=) |Xel- (2) 


geG 


On the other hand, foreach x € X there are |G,| pairs having x as second member. Thus 


we also have 
N= y, IG, |. 


xEX 


By Theorem 16.16 we have |Gx| =(G: G,). But we know that (G : Gy) = |G\/|Gxl, 
so we obtain |G,| = |G|/|Gx|. Then 


SE ie 
N= ies 10 gq) (3) 


Now 1/|Gx| has the same value for all x in the same orbit, and if we let O be any orbit, 
then 


i 1 
wae Lee (4) 


xeO xO 
Substituting (4) in (3), we obtain 
N = |G| (number of orbits in X under G) = |G| +r. (5) 
Comparison of Eq. 2 and Eq. 5 gives Eq. 1. + 


If G is a finite group and X is a finite G-set, then 


ws 1 
(number of orbits in X under G) = —_ - » |X QI. 
IG] 2% 
The proof of this corollary follows immediately from the preceding theorem. Sd 


Let us continue our computation of the number of distinguishable dice as our first 
example. 


We let X be the set of 720 different markings of faces of a cube using from one to six 
dots. Let G be the group of 24 rotations of the cube as discussed above. We saw that the 
number of distinguishable dice is the number of orbits in X under G. Now |G| = 24. 
For g € G where g # e, we have |X,| = 0, because any rotation other than the identity 
element changes any one of the 720 markings into a different one. However, |X.| = 720 
since the identity element leaves all 720 markings fixed. Then by Corollary 17.2, 


1 
(number of orbits) = ve 720 = 30, 


so there are 30 distinguishable dice. A 


17.4 Example 


17.5 Example 


17.6 Example 
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Of course the number of distinguishable dice could be counted without using the 
machinery of the preceding corollary, but by using elementary combinatorics as often 
taught in a freshman finite math course. In marking a cube to make a die, we can, 
by rotation if necessary, assume the face marked | is down. There are five choices 
for the top (opposite) face. By rotating the die as we look down on it, any one of 
the remaining four faces could be brought to the front position, so there are no different 
choices involved for the front face. But with respect to the number on the front face, there 
are3 -2- 1 possibilities for the remaining three side faces. Thus there are5 . 3-2-1 = 30 
possibilities in all. 

The next two examples appear in some finite math texts and are casy to solve by 
elementary means. We use Corollary 17.2 so that we have more practice thinking in 
terms of orbits. 


How many distinguishable ways can seven people be seated at a round table, where 
there is no distinguishable “head” to the table? Of course there are 7! ways to assign 
people to the different chairs. We take X to be the 7! possible assignments. A rotation of 
people achieved by asking each person to move one place to the right results in the same 
arrangement. Such a rotation generates a cyclic group G of order 7, which we consider 
to act on X in the obvious way. Again, only the identity e leaves any arrangement fixed, 
and it leaves all 7! arrangements fixed. By Corollary 17.2 


1 
(number of orbits) = 5" 7! = 6! = 720. A 


How many distinguishable necklaces (with no clasp) can be made using seven different- 
colored beads of the same size? Unlike the table in Example 17.4, the necklace can be 
turned over as well as rotated. Thus we consider the full dihedral group D7 of order 
2-7 = 14 as acting on the set X of 7! possibilities. Then the number of distinguishable 
necklaces is 


1 
(number of orbits) = i 7! = 360. A 


In using Corollary 17.2, we have to compute {G| and |X,| for each g € G. In the 
examples and the exercises, |G| will pose no real problem. Let us give an example where 
|X| is not as trivial to compute as in the preceding examples. We will continue to assume 
knowledge of very elementary combinatorics. 


Let us find the number of distinguishable ways the edges of an equilateral triangle can 
be painted if four different colors of paint are available, assuming only one color is used 
on each edge, and the same color may be used on different edges. 

Of course there are 4° = 64 ways of painting the edges in all, since each of the 
three edges may be any one of four colors. We consider X to be the set of these 64 
possible painted triangles. The group G acting on X is the group of symmetries of the 
triangle, whichis isomorphic to S3 and which we consider to be $3. We use the notation for 
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17.7 Example 


Homomorphisms and Factor Groups 


elements in S; given in Section 8. We need to compute |X, | for each of the six elements g 
in S3 : 


X po| = 04 Every painted triangle is left fixed by pp. 

Xp,|=4 To be invariant under (, all edges must be the 
same color, and there are 4 possible colors. 

Xp,|= 4 Same reason as for (1. 

Xu,| = 16 The edges that are interchanged must be the same 


color (4 possibilities) and the other edge may 
also be any of the colors (times 4 possibilities). 


Xu;| = |Xu;| = 16 Same reason as for (41. 


Then 
So [Xp] = 64 +444 + 16+ 16 + 16 = 120. 
&ES3 
Thus 
1 
(number of orbits) = ra 120 = 20, 
and there are 20 distinguishable painted triangles. A 


We repeat Example 17.6 with the assumption that a different color is used on each edge. 
The number of possible ways of painting the edges is then 4 - 3 . 2 = 24, and we let X be 
the set of 24 possible painted triangles. Again, the group acting on X can be considered 
to be $3. Since all edges are a different color, we see |X,,| = 24 while |X,| = 0 for 


g & po. Thus 
1 
(number of orbits) = 6. 24 =4, 


so there are four distinguishable triangles. A 


@ EXERCISES 17 


Computations 


In each of the following exercises use Corollary 17.2 to work the problem, even though the answer might be obtained 
by more elementary methods. 


1. 
2. 
3. 


Find the number of orbits in {1, 2, 3, 4, 5, 6, 7, 8} under the cyclic subgroup ((1, 3, 5, 6)) of Sg. 
Find the number of orbits in {1, 2, 3, 4, 5, 6, 7, 8} under the subgroup of Sy generated by (1, 3) and (2, 4, 7). 


Find the number of distinguishable tetrahedral dice that can be made using one, two, three, and four dots on the 
faces of a regular tetrahedron, rather than a cube. 


. Wooden cubes of the same size are to be painted a different color on each face to make children’s blocks. How 
many distinguishable blocks can be made if eight colors of paint are available? 
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. Answer Exercise 4 if colors may be repeated on different faces at will. |Hint: The 24 rotations of a cube consist 


of the identity, 9 that leave a pair of opposite faces invariant, 8 that leave a pair of opposite vertices invariant, 
and 6 leaving a pair of opposite edges invariant.] 


Each of the eight corners of a cube is to be tipped with one of four colors, each of which may be used on from 
one to all eight corners. Find the number of distinguishable markings possible. (See the hint in Exercise 5.) 


. Find the number of distinguishable ways the edges of a square of cardboard can be painted if six colors of paint 


are available and 


a. no color is used more than once. 
b. the same color can be used on any number of edges. 


. Consider six straight wires of equal lengths with ends soldered together to form edges of a regular tetrahedron. 


Either a 50-ohm or 100-ohm resistor is to be inserted in the middle of each wire. Assume there are at least six 
of each type of resistor available. How many essentially different wirings are possible? 


. Arectangular prism 2 ft long with 1-ft square ends is to have each of its six faces painted with one of six possible 


colors. How many distinguishable painted prisms are possible if 


a. no color is to be repeated on different faces, 
b. each color may be used on any number of faces? 


Rings and Fields 


18.1 Definition 


Section 18 — Rings and Fields 

Section 19 = Integral Domains 

Section 20 Fermat's and Euler’s Theorems 

Section 21 ‘The Field of Quotients of an Integral Domain 
Section 22 Rings of Polynomials 

Section 23 Factorization of Polynomials over a Field 
Section 24 ‘Noncommutative Examples 

Section 25 ‘Ordered Rings and Fields 


RINGS AND FIELDS 


All our work thus far has been concerned with sets on which a single binary operation 
has been defined. Our years of work with the integers and real numbers show that a study 
of sets on which two binary operations have been defined should be of great importance. 
Algebraic structures of this type are introduced in this section. In one sense, this section 
seems more intutive than those that precede it, for the structures studied are closely 
related to those we have worked with for many years. However, we will be continuing 
with our axiomatic approach. So, from another viewpoint this study is more complicated 
than group theory, for we now have two binary operations and more axioms to deal with. 


Definitions and Basic Properties 


The most general algebraic structure with two binary operations that we shall study is 
called a ring. As Example 18.2 following Definition 18.1 indicates, we have all worked 
with rings since grade school. 


A ring (R,+,-) is a set R together with two binary operations + and -, which we call 
addition and multiplication, defined on R such that the following axioms are satisfied: 
H,. (R, +) is an abelian group. 
#,. Multiplication is associative. 


#;. Foralla, b,c € R, the left distributive law, a - (6 + c) = (a -b) + (a-c)and 
the right distributive law (a + b)-c = (a-c)+(-c) hold. | 


” Sections 24 and 25 are not required for the remainder of the text. 
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18.2 Example 


Rings and Fields 


We are well aware that axioms .#,,.#2,, and .#, for a ring hold in any subset of the 
complex numbers that is a group under addition and that is closed under multiplication. 


For example, (Z, +, -), (Q, +,-), (R, +, -), and (C, +, -) are rings. A 


HIstToRIcAL NOTE 


lhe theory of rings grew out of the study of two 

particular classes of rings, polynomial rings in 
n variables over the real or complex numbers (Sec- 
tion 22) and the “integers” of an algebraic number 
field. It was David Hilbert (1862-1943) who first 
introduced the term ring, in connection with the lat- 
ter example, but it was not until the second decade 
of the twentieth century that a fully abstract defi- 
nition appeared. The theory of commutative rings 
was given a firm axiomatic foundation by Emmy 
Noether (1882-1935) in her monumental paper 
“Tdeal Theory in Rings,” which appeared in 1921.A 
major concept of this paper is the ascending chain 
condition for ideals. Noether proved that in any ring 
in which every ascending chain of ideals has a max- 
imal element, every ideal is finitely generated. 

Emmy Noether received her doctorate from the 
University of Erlangen, Germany, in 1907. Hilbert 


invited her to Gottingen in 1915, but his efforts to 
secure her a paid position were blocked because 
of her sex. Hilbert complained, “I do not see that 
the sex of the candidate is an argument against her 
admission [to the faculty]. After all, we are a uni- 
versity, not a bathing establishment.” Noether was, 
however, able to lecture under Hilbert’s name. Ul- 
timately, after the political changes accompanying 
the end of the First World War reached Gottingen, 
she was given in 1923 a paid position at the Univer- 
sity. For the next decade, she was very influential 
in the development of the basic concepts of modern 
algebra. Along with other Jewish faculty members, 
however, she was forced to leave Gottingen in 1933. 
She spent the final two years ofher life at Bryn Mawr 
College near Philadelphia. 


18.3 Example 


It is customary to denote multiplication in a ring by juxtaposition, using ab in place 
of a:b. We shall also observe the usual convention that multiplication is performed 
before addition in the absence of parentheses, so the left distributive law, for example, 
becomes 


a(b+c)=ab+ac, 


without the parentheses on the right side of the equation. Also, as a convenience analogous 
to our notation in group theory, we shall somewhat incorrectly refer to a ring R in place 
of a ring (R, +, -), provided that no confusion will result. In particular, from now on Z 
will always be (Z, +, -), and Q, R, and C will also be the rings in Example 18.2. We 
may on occasion refer to (R, +) as the additive group of the ring R. 


Let R be any ring and let M,,(R) be the collection of all n x n matrices having elements 
of R as entries. The operations of addition and multiplication in R allow us to add 
and multiply matrices in the usual fashion, explained in the appendix. We can quickly 
check that (M,,(R), +) is an abelian group. The associativity of matrix multiplication 
and the two distributive laws in M,,(R) are more tedious to demonstrate, but straight- 
forward calculations indicate that they follow from the same properties in R. We will 


18.4 Example 


18.5 Example 


18.6 Example 


18.7 Example 
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assume from now on that we know that M,(R) is a ring. In particular, we have the 
rings M,,(Z), M,(Q), M,(R), and M,(C). Note that multiplication is not a commutative 
operation in any of these rings for n > 2. A 


Let F be the set of all functions f : R > R. We know that (F, +) is an abelian group 
under the usual function addition, 


(f + 8x) = f) + 8). 
We define multiplication on F by 


(fg)(x) = f(x)g(*). 


That is, fg is the function whose value at x is f(x)g(x). It is readily checked that F is a 
ring; we leave the demonstration to Exercise 34. We have used this juxtaposition notation 
ou for the composite function o(t(x)) when discussing permutation multiplication. If 
we were to use both function multiplication and function composition in F, we would 
use the notation f o g for the composite function. However, we will be using compo- 
sition of functions almost exclusively with homomorphisms, which we will denote by 
Greek letters, and the usual product defined in this example chiefly when multiplying 
polynomial function f(«)g(x), so no confusion should result. A 


Recall that in group theory, nZ is the cyclic subgroup of Z under addition consisting of 
all integer multiples of the integer n. Since (nr)(ns) = n(nrs), we see that nZ, is closed 
under multiplication. The associative and distributive laws which hold in Z then assure 
us that (nZ, +, -) is as ring. From now on in the text, we will consider nZ to be this ring. 

A 


Consider the cyclic group (Z,, +). If we define for a,b ¢ Z, the product ab as the 
remainder of the usual product of integers when divided by n, it can be shown that 
(Zn, +; +) is a ring. We shall feel free to use this fact. For example, in Zip we have 
(3)(7) = 1. This operation on Z, is multiplication modulo n. We do not check the ring 
axioms here, for they will follow in Section 26 from some of the theory we develop 


there. From now on, Z,, will always be the ring (Z,, +, -). A 
If R,, Ro, +++, Ry are rings, we can form the set Ry x Ry x --: x R, of all ordered 
n-tuples (71, 72, +++, 7), where r; € R;. Defining addition and multiplication of n-tuples 


by components (just as for groups), we see at once from the ring axioms in each compo- 
nent that the set of all these n-tuples forms a ring under addition and multiplication by 
components. The ring Rj x R, x --- x R,, is the direct product of the rings R;. A 


Continuing matters of notation, we shall always let O be the additive identity of a 
ting. The additive inverse of an element a of a ring is —a. We shall frequently have 
occasion to refer to a sum 


a+ta+:--+a4 


having n summands. We shall let this sum be n - a, always using the dot. However, n - a 
is not to be constructed as a multiplication of n and a in the ring, for the integer n may 
not be in the ring at all. fin < 0, we let 


n-a=(—a)+ (—a)++--+ (—a) 
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18.8 Theorem 


Proof 


Rings and Fields 


for |n| summands. Finally, we define 
0-a=0 

for 0 € Z on the left side of the equations and 0 € RF on the right side. Actually, the 
equation 0a = 0 holds also for 0 € R on both sides. The following theorem proves this 
and various other elementary but important facts. Note the strong use of the distributive 
laws in the proof of this theorem. Axiom .#, for a ring concerns only addition, and 
axiom .#, concerns only multiplication. This shows that in order to prove anything that 
gives a relationship between these two operations, we are going to have to use axiom 
.é,. For example, the first thing that we will show in Theorem 18.8 is that Oa = 0 for 
any element a in a ring R. Now this relation involves both addition and multiplication. 
The multiplication Oa stares us in the face, and 0 is an additive concept. Thus we will 
have to come up with an argument that uses a distributive law to prove this. 


If R is a ring with additive identity 0, then for any a, b € R we have 
1. Od =a0=0, 
2. a(—b) = (—a@)b = —(ab), 
3. (-—a)(—b) = ab. 
For Property 1, note that by axioms #, and A, 
a0+a0=a0+0)=a0=0+4 a0. 
Then by the cancellation law for the additive group (R, +), we have a0 = 0. Likewise, 
Oa + 0a = (0+ Oa = 0a = 0+ 0a 


implies that Oa = 0. This proves Property 1. 

In order to understand the proof of Property 2, we must remember that, by definition, 
—(ab) is the element that when added to ab gives 0. Thus to show that a(—b) = —(ab), 
we must show precisely that a(—b) + ab = 0. By the left distributive law, 


a(—b) + ab = a(—b + b) = a0 = 0,7 
since a0 = 0 by Property 1. Likewise, 

(-a)b+ab=(-a+a)b=0b=0. 
For Property 3, note that 


(—a)(—b) = —(a(—b)) 
by Property 2. Again by Property 2, 
—(a{—b)) = —(—(@b)), 


and —(—(ab)) is the element that when added to —(ab) gives 0. This is ab by definition 
of —(ab) and by the uniqueness of an inverse in a group. Thus, (—a)(—b) = ab. Ad 


It is important that you understand the preceding proof. The theorem allows us to 
use our usual rules for signs. 


18.9 Definition 


18.10 Example 


18.11 Example 
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Homomorphisms and Isomorphisms 


From our work in group theory, it is quite clear how a structure-relating map of aring R 
into a ring R’ should be defined. 


For rings R and R’, a map ¢ : R > R’ is a homomorphism if the following two con- 
ditions are satisfied for alla, b € R: 


1. o(a+b)=9(@)+ o), 
2. (ab) = (aol). a 


In the preceding definition, Condition 1 is the statement that ¢ is a homomor- 
phism mapping the abelian group (R, +) into (R’, +). Condition 2 requires that ¢ relate 
the multiplicative structures of the rings R and R’ in the same way. Since ¢ is also 
a group homomorphism, all the results concerning group homomorphisms are valid 
for the additive structure of the rings. In particular, ¢ is one to one if and only if its 
kernel Ker(¢) = {a € R| (a) = 0} is just the subset {0} of R. The homomorphism 
@ of the group (R, +) gives rise to a factor group. We expect that a ring homomor- 
phism will give rise to a factor ring. This is indeed the case. We delay discussion of 
this to Section 26, where the treatment will parallel our treatment of factor groups in 
Section 14. 


Let F be the ring of all functions mapping R into R defined in Example 18.4. For each 
a € R, we have the evaluation homomorphism ¢, : F — R, where ¢,(f) = f(a) for 
f ¢ F. We defined this homomorphism for the group (F, +) in Example 13.4, but we 
did not do much with it in group theory. We will be working a great deal with it in the 
rest of this text, for finding a real solution of a polynomial equation p(x) = 0 amounts 
precisely to finding a € R such that ¢,(p) = 0. Much of the remainder of this text deals 
with solving polynomial equations. We leave the demonstration of the multiplicative 
homomorphism property 2 for ¢, to Exercise 35. A 


The map ¢ : Z > Z, where (a) is the remainder of a modulo n is a ring homomor- 
phism for each positive integer n. We know ¢(a + b) = d(a) + G(4) by group theory. 
To show the multiplicative property, write a = qin +7, and b = qn + rz according 
to the division algorithm. Then ab = n(qigon + r1qg2 + qir2) + ryr2. Thus 6(ab) is the 
remainder of rjr2 when divided by n. Since O(a) =r, and O(b) = 12, Example 18.6 
indicates that ¢(a)@(b) is also this same remainder, so (ab) = ¢(a)(b). From group 
theory, we anticipate that the ring Z, might be isomorphic to a factor ring Z/nZ. This 
is indeed the case; factor rings will be discussed in Section 26. A 


We realize that in the study of any sort of mathematical structure, an idea of basic 
importance is the concept of two systems being structurally identical, that is, one being 
just like the other except for names. In algebra this concept is always called isomorphism. 
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18.12 Definition 


18.13 Example 


18.14 Definition 


18.15 Example 


Rings and Fields 


The concept of two things being just alike except for names of elements leads us, just as 
it did for groups, to the following definition. 


Anisomorphism ¢ : R > R’ from aring R to aring R’ is a homomorphism that is one 
to onc and onto R’. The rings R and R’ are then isomorphic. = 


From our work in group theory, we expect that isomorphism gives an equivalence 
relation on any collection of rings. We need to check that the multiplicative property of an 
isomorphism is satisfied for the inverse map #~! : R’ > R (to complete the symmetry 
argument). Similarly, we check that if 4 : R’ > R” is also a ring ismorphism, then the 
multiplicative requirement holds for the composite map u¢ : R > R” (to complete the 
transitivity argument). We ask you to do this in Exercise 36. 


As abelian groups, (Z,+) and (2Z, +) are isomorphic under the map @:Z— Z, 
with @(x) = 2x for x € Z. Here @ is not a ring isomorphism, for dy) = 2xy, while 
O(x)O(y) = 2x2y = 4xy. a 


Multiplicative Questions: Fields 


Many of the rings we have mentioned, such as Z, Q, and R, have a multiplicative identity 
element 1. However, 2Z does not have an identity element for multiplication. Note also 
that multiplication is not commutative in the matrix rings described in Example 18.3. 

It is evident that {0}, with 0 + 0 = 0 and (0)(0) = 0, gives a ring, the zero ring. 
Here 0 acts as multiplicative as well as additive identity element. By Theorem 18.8, 
this is the only case in which 0 could act as a multiplicative identity element, for from 
Oa = 0, we can then deduce that a = 0. Theorem 3.13 shows that if a ring has a multi- 
plicative identity element, it is unique. We denote a multiplicative identity element in a 
ring by l. 


Aring in which the multiplication is commutative is a commutative ring. A ring with a 
multiplicative identity element is a ring with unity; the multiplicative identity element 
1 is calied “unity.” a 


In a ring with unity 1 the distributive laws show that 


(414-41) (414-40 =0414--40, 
n summands m summands nm summands 


that is, (1 - 1)(m - 1) = (nm) - 1. The next example gives an application of this observa- 
tion. 


We claim that for integers r and s where gcd(r, s) = 1, the rings Z,; and Z, x Zy are 
isomorphic. Additively, they are both cyclic abelian groups of order rs with generators 
1 and (1, 1) respectively. Thus @ : Z,, > Z, x Z, defined by ¢(n - 1) =n - (1, 1) is an 
additive group isomorphism. To check the multiplicative Condition 2 of Definition 18.9, 


18.16 Definition 
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we use the observation preceding this example for the unity (1, 1) in the ring Z, x Z,, 
and compute. 


o(nm) = (am) - (0,1) = [n- (1, D][m- 0, 1D] = 6) Gn). A 


Note that a direct product R; x Ro x --- x R, of rings is commutative or has unity 
if and only if each R; is commutative or has unity, respectively. 

In a ring R with unity 1 40, the set R* of nonzero elements, if closed under 
the ring multiplication, will be a multiplicative group if multiplicative inverses exist. 
A multiplicative inverse of an element a in a ring R with unity 1 4 0 is an element 
a7! € Rsuchthat aa~! = a~!a = 1. Precisely as for groups, a multiplicative inverse for 
an element a in R is unique, if it exists at all (see Exercise 43). Theorem 18.8 shows that 
it would be hopeless to have a multiplicative inverse for 0 except for the ring {0}, where 
0+ 0 =0 and (0)(0) = 0, with 0 as both additive and multiplicative identity element. 
We are thus led to discuss the existence of multiplicative inverses for nonzero elements 
in a ring with nonzero unity. There is unavoidably a lot of terminology to be defined in 
this introductory section on rings. We are almost done. 


Let R be aring with unity 1 4 0. Anelement uv in R is a unit of R if it has a multiplicative 
inverse in R. If every nonzero element of R is a unit, then R is a division ring (or skew 
field). A field is a commutative division ring. A noncommutative division ring is called 
a “strictly skew field.” ] 


Let us find the units in Z,4. Of course, 1 and —1 = 13 are units. Since (3)(5) = 1 we 
see that 3 and 5 are units; therefore —3 = 11 and —5 = 9 are also units. None of the 
remaining elements of Z4 can be units, since no multiple of 2, 4, 6, 7, 8, or 10 can 
be one more than a multiple of 14; they all have a common factor, either 2 or 7, with 
14. Section 20 will show that the units in Z, are precisely those m € Z, such that 
gcd(m, n) = 1. A 


Z is not a field, because 2, for example, has no multiplicative inverse, so 2 is not a unit 
in Z. The only units in Z are 1 and —1. However, Q and R are fields. An example of a 
strictly skew field is given in Section 24. A 


We have the natural concepts of a subring of a ring and subfield of a field. A subring 
of a ring is a subset of the ring that is a ring under induced operations from the whole 
ring; a subfield is defined similarly for a subset of a field. In fact, let us say here once and 
for all that if we have a set, together with a certain specified type of algebraic structure 
(group, ring, field, integral domain, vector space, and so on), then any subset of this set, 
together with a natural induced algebraic structure that yields an algebraic structure of 
the same type, is a substructure. If K and L are both structures, we shalllet K < LZ denote 
that K is a substructure of L and K < L denote that K < L but K + L. Exercise 48 
gives criteria for a subset S$ of a ring R to form a subring of R. 
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Finally, be careful not to confuse our use of the words unit and unity. Unity is 
the multiplicative identity element, while a unit is any element having a multiplicative 
inverse. Thus the multiplicative identity element or unity is a unit, but not every unit is 
unity. For example, ~1 is a unit in Z, but —1 is not unity, that is, —1 4 1. 


ooo 


@ HIsTorIcAL NOTE 


Ithough fields were implict in the early work 

on the solvability of equations by Abel and 
Galois, it was Leopold Kronecker (1823-1891) 
who in connection with his own work on this subject 
first published in 1881 a definition of what he called 
a “domain of rationality”: “The domain of rational- 
ity (R’, R”, R, ---) contains - - - every one of those 
quantities which are rational functions of the quan- 
tities R’, R’, R”,.-- with integral coefficients.” 
Kronecker, however, who insisted that any math- 
ematical subject must be constructible in finitely 
many steps, did not view the domain of rationality 
as acomplete entity, but merely as a region in which 
took place various operations on its elements. 

Richard Dedekind (1831-1916), the inventor 
of the Dedekind cut definition of a real number, 
considered a field as a completed entity. In 1871, 


@ EXERCISES 18 


Computations 


he published the following definition in his supple- 
ment to the second edition of Dirichlet’s text on 
number theory: “By a field we mean any system of 
infinitely many real or complex numbers, which in 
itself is so closed and complete, that the addition, 
subtraction, multiplication, and division of any two 
numbers always produces a number of the same sys- 
tem.” Both Kronecker and Dedekind had, however, 
dealt with their varying ideas of this notion as early 
as the 1850s in their university lectures. 

A more abstract definition of a field, similar 
to the one in the text, was given by Heinrich Weber 
(1842-1913) in a paper of 1893. Weber’s definition, 
unlike that of Dedekind, specifically included fields 
with finitely many elements as well as other fields, 
such as function fields, which were not subfields of 
the field of complex numbers. 


In Exercises 1 through 6, compute the product in the given ring. 


1. (12)(16) in Zog 
3. (11)(—4) in Zis 
5. (2,3)(3,5) in Zs x Zo 


In Exercises 7 through 13, decide whether the indicated operations of addition and multiplication are defined 
(closed) on the set, and give a ring structure. If a ring is not formed, tell why this is the case. If a ring is formed, 


2. (16)(3) in Za. 
4, (20)(—8) in Zo6 
6. (-3,5)(2,-4) in Lia x Z\\ 


state whether the ring is commutative, whether it has unity, and whether it is a field. 


7. nZ with the usual addition and multiplication 


8. Z* with the usual addition and multiplication 


9. Z x Z with addition and multiplication by components 


10. 2Z x Z with addition and multiplication by components 
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11. fa + bV/2|a, b € Z} with the usual addition and multiplication 

12. {a+ bJ/2 |a, b € Q} with the usual addition and multiplication 

13. The set of all pure imaginary complex numbers ri for r € IR with the usual addition and multiplication 
In Exercises 14 through 19, describe all units in the given ring 


14, Z 15.Z2xZ 16. Zs 

17. Q 18. Z2xQxZ 19, Zs 

20. Consider the matrix ring M>(Z2). 
a. Find the order of the ring, that is, the number of elements in it. 
b. List all units in the ring. 

21. If possible, give an cxample of a homomorphism ¢ : R — R' where R and R’ are rings with unity 1 4 0 and 
1’ £0’, and where 6(1) 4 0’ and (1) 4 I’. 


22. (Linear algebra) Consider the map det of M,,(R) into R where det(A) is the determinant of the matrix A for 
A € M,,(R). Is det a ring homomorphism? Why or why not? 


23. Describe all ring homomorphisms of Z into Z. 
24, Describe all ring homomorphisms of Z into Z x Z. 
25. Describe all ring homomorphisms of Z x Z into Z. 
26. How many homomorphisms are there of Z x Z x Z into Z? 
27. Consider this solution of the equation X° = J; in the ring M3(R). 
V=ak implies X* — Iz = 0, the zero matrix, so factoring, we have (X — 4)(X + 4) =0 
whence either X = J; or X = —J3. 
Is this reasoning correct? If not, point out the error, and if possible, give a counterexample to the conclusion. 


28. Find all solutions of the equation x? + x — 6 = Ointhe ring Z;,4 by factoring the quadratic polynomial. Compare 
with Exercise 27. 


Concepts 


In Exercises 29 and 30, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a from acceptable for publication. 


29. A field F is aring with nonzero unity such that the set of nonzero elements of F is a group under multiplication. 
30. A unit in a ring is an element of magnitude 1. 
31, Give an example of a ring having two elements a and b such that ab = O but neither a nor b is zero. 


32, Give an example of a ring with unity 1 4 0 that has a subring with nonzero unity 1’ 4 1. [Hint: Consider a 
direct product, or a subring of Z,.] 


33, Mark each of the following true or false. 


a. Every field is also a ring. 
b. Every ring has a multiplicative identity. 


c. Every ring with unity has at least two units. 


d. Every ring with unity has at most two units. 


DR ————— zz E 
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____ e, It is possible for a subset of some field to be aring but not a subfield, under the induced operations. 
______ f The distributive laws for a ring are not very important. 

_____ g. Multiplication in a field is commutative. 

h. The nonzero elements of a field form a group under the multiplication in the field. 


i. Addition in every ring is commutative. 
j. Every element in a ring has an additive inverse. 


Theory 


34. 


35. 
36. 


37. 


38. 
39. 
40. 
41 


42. 


43. 
44, 


45. 


46. 


47, 
48. 


49. 


50. 


Show that the multiplication defined on the set F of functions in Example 18.4 satisfies axioms KR, and Fe, 
for a ring. 
Show that the evaluation map ¢, of Example 18.10 satisfies the multiplicative requirement for ahomomorphism. 


Complete the argument outlined after Definitions 18.12 to show that isomorphism gives an equivalence relation 
on a collection of rings. 


Show that if U is the collection of all units in a ring (R, +, -) with unity, then (U, -) is a group. [Warning: Be 
sure to show that U is closed under multiplication.] 


Show that a2 — b? = (a + b\a — b) for alla and b inating R if and only if R is commutative. 
Let (R, +) be an abelian group. Show that (R, +, -) is a ring if we define ab = 0 for alla, be R. 
Show that the rings 2Z and 3Z are not isomorphic. Show that the fields R and C are not isomorphic. 


(Freshman exponentiation) Let p be a prime. Show that in the ring Z, we have (a + b)? = a? +b? for all 
a,b € Zp. [Hint: Observe that the usual binomial expansion for (a + b)" is valid in a commutative ring.] 


Show that the unity element ina subfield of a field must be the unity of the whole field, in contrast to Exercise 32 
for rings. 

Show that the multiplicative inverse of a unit a ring with unity is unique. 

An element a of ating R is idempotent if a=a. 

a. Show that the set of all idempotent elements of a commutative ring is closed under multiplication. 

p. Find all idempotents in the ring Ze xX Zi. 


(Linear algebra) Recall that for an m x n matrix A, the transpose AT of A is the matrix whose jth column 
is the jth row of A. Show that if A is an m x n matrix such that ATA is invertible, then the projection matrix 
P = A(A‘A)"'A? is an idempotent in the ring of n x a matrices. 


Anelement a of aring R is nilpotent if a? — Ofor some n € Z*. Show thatifa and b are nilpotent elements 
of a commutative ring, then a + b is also nilpotent. 


Show that a ring R has no nonzero nilpotent element if and orily if 0 is the only solution of x7 = Oin R. 


Show that a subset S of aring R gives a subring of R if and only if the following hold: 


OES; 
(a—b)€ Sforalla,be S$; 
ab ¢ Sforalla,be S. 


a. Show that an intersection of subrings of aring R is again a subring of R. 
b. Show that an intersection of subfields of a field F is again a subfield of F. 


Let R be aring, and let a bea fixed element of R. Let I, = {x € Rl ax = 0}. Show that J, is a subring of R. 


51. 


52. 


53 


54. 


55, 


56. 
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Let R be a ring, and let a be a fixed element of R. Let R, be the subring of R that is the intersection of all 
subrings of R containing a (see Exercise 49). The ring R, is the subring of R generated by a. Show that the 
abelian group (Rg, +) is generated (in the sense of Section 7) by {a” |n € Z}. 


(Chinese Remainder Theorem for two congruences) Let r and s be positive integers such that gcd(r, s) = 1. 
Use the isomorphism in Example 18.15 to show that for m,n € Z, there exists an integer x such that x =m 
(mod r) and x =n (mod s). 


a. State and prove the generalization of Example 18.15 for a direct product with n factors. 


b. Prove the Chinese Remainder Theorem: Let a;,b; € Z* for i = 1,2,---,m and let ged(b;,b;) = 1 for 
i ~ j. Then there exists x ¢ Z* such that x = a; (mod b;) fori = 1,2,---,n. 


Consider (S$, +, -), where S is a set and + and - are binary operations on S such that 
(5, +) is a group, 
(S*, +) is a group where S* consists of all elements of S except the additive identity element, 
a(b +c) = (ab) + (ac) and (a + b)c = (ac) + (bc) for alla, b,c € S. 


Show that (§, +, -) is a division ring. [Hint: Apply the distributive Jaws to (1 + 1)(a + 5) to prove the commu- 
tativity of addition.] 


A ting R is a Boolean ring if a? =a for all a € R, so that every element is idempotent. Show that every 
Boolean ring is commutative. 


(For students having some knowledge of the laws of set theory) For a set S, let AS) be the collection of all 
subsets of S. Let binary operations + and - on H(S) be defined by 


A+B=(AUB)-(ANB)={x]x eAorx € Bbutx g(ANB)} 
and 
A-B=ANB 
for A, B € F(S). 


a. Give the tables for + and - for A(S), where S = {a, b}. [Hint: A(S) has four elements.] 
b. Show that for any set S, (7(S), +, -) is a Boolean ring (see Exercise 55). 


INTEGRAL DOMAINS 

While a careful treatment of polynomials is not given until Section 22, for purposes of 
motivation we shall make intuitive use of them in this section. 

Divisors of Zero and Cancellation 


One of the most important algebraic properties of our usual number system is that a 
product of two numbers can only be 0 if at least one of the factors is 0. We have used 
this fact many times in solving equations, perhaps without realizing that we were using 
it. Suppose, for example, we are asked to solve the equation 


x7 —5x+6=0. 
The first thing we do is to factor the left side: 


x? — 5x +6 = (x —2)(x — 3). 
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19.1 Example 


Solution 


19.2 Definition 


19.3 Theorem 


Proof 


19.4 Corollary 


Proof 
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Then we conclude that the only possible values for x are 2 and 3. Why? The reason is that 
if x is replaced by any number a, the product (a — 2)(a — 3) of the resulting numbers 
is O if and only if either a -2 =O ora—3=0. 


Solve the equation x? — 5x +6=0inZy. 


The factorization x2 — 5x + 6 = (x — 2)(x — 3) is still valid if we think of x as standing 
for any number in Zy2. But in Z12, not only is Oa = a0 = 0 for alla € Zyp, but also 
(2)(6) = (6)(2) = 3)(4) = (43) = (38) = (8)G) 
= (4)(6) = (6)(4) = (49) = O)(A4) = (6) = (6)(8) 
= (8)(6) = (6)(10) = (10)(6) = (8)(9) = (8) = 0. 


We find, in fact, that our equation has not only 2 and 3 as solutions, but also 6 and 11, 
for (6 — 2)(6 — 3) = (4)(3) = 0 and (11 — 2)(11 — 3) = (9)(8) = Oin Zyp. A 


These ideas are of such importance that we formalize them in a definition. 


If a and b are two nonzero elements of a ring R such that ab = 0, then a and b are 
divisors of 0 (or 0 divisors). | 


Example 19.1 shows that in Z2 the elements 2, 3, 4, 6, 8, 9, and 10 are divisors 
of 0. Note that these are exactly the numbers in Z2 that are not relatively prime to 12, 
that is, whose ged with 12 is not 1. Our next theorem shows that this is an example of a 
general situation. 


In the ring Z,,, the divisors of 0 are precisely those nonzero elements that are not rela- 
tively prime to n. 


Let m € Z,, where m + 0, and let the gcd of m andn be d #1. Then 


“5)=(3) 


and (m/d)n gives 0 as a multiple of n. Thus m(n/d) = Oin Z,, while neither m nor n/d 
is 0, so m is a divisor of 0. 

On the other hand, suppose m € Z, is relatively prime to n. If for s € Z, we have 
ms = 0, then n divides the product ms of m and s as elements in the ring Z. Since n is 
relatively prime to m, boxed Property 1 following Example 6.9 shows that n divides s, 
sos = O0in Z,. ¢ 


If p is a prime, then Z, has no divisors of 0. 
This corollary is immediate from Theorem 19.3. 4 


Another indication of the importance of the concept of 0 divisors is shown in the 
following theorem. Let R be a ring, and let a, b,c ¢ R. The cancellation laws hold in 
R if ab = ac with a £ 0 implies b = c, and ba = ca with a # 0 implies b = c. These 


19.5 Theorem 


Proof 


19.6 Definition 


19.7 Example 
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are multiplicative cancellation laws. Of course, the additive cancellation laws hold in R, 
since (R, +) is a group. 


The cancellation laws hold in a ring R if and only if R has no divisors of 0. 


Let R be a ring in which the cancellation laws hold, and suppose ab = 0 for some 
a, b € R. We must show that either a or b is 0. If a 4 0, thenab = a0 implies that b = 0 
by cancellation laws. Similarly, b 4 0 implies that a = 0, so there can be no divisors of 
O if the cancellation laws hold. 

Conversely, suppose that R has no divisors of 0, and suppose that ab = ac with 
a # 0. Then 


ab—ac=a(b—c)=0. 


Since a ¢ 0, and since R has no divisors of 0, we must have b —c = 0,80 b =c. 
A similar argument shows that ba = ca with a 4 0 implies b = c. a 


Suppose that R is aring with no divisors of 0. Then an equation ax = b, witha # 0, 
in R can have at most one solution x in R, for ifax; = b and ax = b, thenax, = ax, 
and by Theorem 19.5 x1 = x2, since R has no divisors of 0. If R has unity 1 # 0 and a is 
a unit in R with multiplicative inverse a~', then the solution x of ax = bis a~'b. In the 
case that R is commutative, in particular if R is a field, it is customary to denote a'b 
and ba~' (they are equal by commutativity) by the formal quotient b/a. This quotient 
notation must not be used in the event that R is not commutative, for then we do not 
know whether b/a denotes a~'b or ba7!. In particular, the multiplicative inverse a of 
a nonzero element a of a field may be written I/a. 


Integral Domains 


The integers are really our most familiar number system. In terms of the algebraic 
properties we are discussing, Z is a commutative ring with unity and no divisors of 0. 
Surely this is responsible for the name that the next definition gives to such a structure. 


Anintegral domain D is a commutative ring with unity 1 ~ 0 and containing no divisors 
of 0. = 


Thus, if the coefficients of a polynomial are from an integral domain, one can solve 
a polynomial equation in which the polynomial can be factored into linear factors in the 
usual fashion by setting each factor equal to 0. 

In our hierarchy of algebraic structures, an integral domain belongs between a 
commutative ring with unity and a field, as we shall show. Theorem 19.5 shows that the 
cancellation laws for multiplication hold in an integral domain. 


We have seen that Z and Z, for any prime p are integral domains, but Z, is not an integral 
domain if n is not prime. A moment of thought shows that the direct product R x S of 
two nonzero rings R and S is not an integral domain. Just observe that for r ¢ R and 
s € S both nonzero, we have (r, 0)(0, 5) = (0, 0). A 
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19.8 Example 


Solution 


19.9 Theorem 


Proof 
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Show that although Zy is an integral domain, the matrix ring M2 (Z2) has divisors of zero. 
1 0\f0 0\_ {fo O 
0 O/\1 Of \O OF A 


Our next theorem shows that the structure of a field is still the most restrictive (that 
is, the richest) one we have defined. 


We need only observe that 


Every field F is an integral domain. 


Let a,b € F, and suppose that a # 0. Then if ab = 0, we have 


()an= (oa 
+= (ane [pp 


We have shown that ab = 0 with a ¢ 0 implies that b = Oin F, so there are no divisors 
of Oin F. Of course, F is a commutative ring with unity, so our theorem is proved. 


But then 


Figure 19.10 gives a Venn diagram view of containment for the algebraic structures 
having two binary operations with which we will be chiefly concerned. In Exercise 20 
we ask you to redraw this figure to include strictly skew fields as well. 

Thus far the only fields we know are Q, R, and C. The corollary of the next theorem 
will exhibit some fields of finite order! The proof of this theorem is a personal favorite. 
It is done by counting. Counting is one of the most powerful techniques in mathematics. 


Commutative 


rings 


Domains 


19.10 Figure A collection of rings. 


19.11 Theorem 
Proof 


19.12 Corollary 


Proof 


19.13 Definition 


19.14 Example 
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Every finite integral domain is a field. 
Let 
0, 1,41,°°+,an 


be all the elements of a finite domain D. We need to show that fora € D, where a 4 0, 
there exists b € D such that ab = 1. Now consider 


al, day, +++, Ady. 


We claim that all these elements of D are distinct, for aa; = aa; implies that a; = a;, by 
the cancellation laws that hold in an integral domain. Also, since D has no 0 divisors, none 


of these elements is 0. Hence by counting, we find that a1, aa,,---, aa, are elements 
1, a), +++, dG, in some order, so that either a1 = 1, that is, a = 1, or aa; = 1 for some i. 
Thus a has a multiplicative inverse. ¢ 


If p is a prime, then Z, is a field. 


This corollary follows immediately from the fact that Z, is an integral domain and from 
Theorem 19.11. a4 


The preceding corollary shows that when we consider the ring M,(Z,), we are 
talking about a ring of matrices over a field. In the typical undergraduate linear algebra 
course, only the field properties of the real or complex numbers are used in much of the 
work. Such notions as matrix reduction to solve linear systems, determinants, Cramer’s 
tule, eigenvalues and eigenvectors, and similarity transformations to try to diagonalize 
a matrix are valid using matrices over any field; they depend only on the arithmetic 
properties of a field. Considerations of linear algebra involving notions of magnitude, 
suchas least-squares approximate solutions or orthonormal bases, only make sense using 
fields where we have an idea of magnitude. The relation 


p-l=i+i4---41=0 
p summands 


indicates that there can be no very natural notion of magnitude in the field Zy. 


The Characteristic of a Ring 


Let R be any ring. We might ask whether there is a positive integer n such thatn -a =0 
for all a © R, where n-a means a+a+---+a for m summands, as explained in 
Section 18. For example, the integer m has this property for the ring Z,. 


If for aring R a positive integer n exists such that n -a = 0 for alla € R, then the least 
such positive integer is the characteristic of the ring R. If no such positive integer 
exists, then R is of characteristic 0. a 


We shall be using the concept of a characteristic chiefly for fields. Exercise 29 asks 
us to show that the characteristic of an integral domain is either 0 or a prime p. 


The ring Z,, is of characteristic n, while Z, Q, IR, and C all have characteristic 0. A 


© 
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19.15 Theorem 


Proof 
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At first glance, determination of the characteristic of a ring seems to be a tough job, 
unless the ring is obviously of characteristic 0. Do we have to examine every element a 
of the ring in accordance with Definition 19.13? Our final theorem of this section shows 
that if the ring has unity, it suffices to examine only a = |. 


Let R be ating with unity. Ifn-17 0 for all n € Z*, then R has characteristic 0. If 
n-1=0 for somen € Z*, then the smallest such integer n is the characteristic of R. 


Ifn-1#40foralln ¢ Zt, then surely we cannot have n-a = 0 for alla € R for some 
positive integer 7, so by Definition 19.13, R has characteristic 0. 
Suppose that n is a positive integer such that n - 1 = 0. Then for any a € R, we have 


neacatate-tasadtlte-+D=an-1)=a0=0. 


Our theorem follows directly. Sd 


= EXERCISES 19 


Computations 


1. Find all solutions of the equation x3 — 2x? —3x = Oin Zp. 
2. Solve the equation 3x = 2 in the field Zz; in the field Zo3. 
3, Find all solutions of the equation x2 4+2x +2 =0in Ze. 

4. Find all solutions of x? + 2x +4 =0 in Ze. 


In Exercises 5 through 10, find the characteristic of the given ring. 


5, 2Z 
8. Zs x Z3 


6. Z2xZ 7. Z3 x 32 
9, Z3 x Za 10. Ze x Zis 


11. Let R be a commutative ring with unity of characteristic 4. Compute and simplify (@ + b)* fora, b € R. 


12. Let R be a commutative ring with unity of characteristic 3. Compute and simplify (a + b)? for a,b € R. 


13. Let R be a commutative ring with unity of characteristic 3. Compute and simplify (a + by) fora,b eR. 


14, Show that the matrix E i| is a divisor of zero in M2(Z). 


Concepts 


In Exercises 15 and 16, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 


15. If ab = 0, then a and b are divisors of zero. 


16. Ifn-a = 0 for all elements a in aring R, then n is the characteristic of R. 


17. Mark cach of the following true or false. 


a. nZ has zero divisors if n is not prime. 
b. Every field is an integral domain. 
c. The characteristic of nZ is n. 


18. 


19. 


20. 
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d. As aring, Z is isomorphic to nZ for alln > 1. 

______ e, The cancellation law holds in any ring that is isomorphic to an integral domain. 
f. Every integral domain of characteristic 0 is infinite. 

_____. g. The direct product of two integral domains is again an integral domain. 


______h. A divisor of zero in a commutative ring with unity can have no multiplicative inverse. 

i. nZ is a subdomain of Z. 

j. Zis a subfield of Q. 

Each of the six numbered regions in Fig. 19.10 corresponds to a certain type of a ring. Give an example of a 


ring in each of the six cells. For example, a ring in the region numbered 3 must be commutative (it is inside 
the commutative circle), have unity, but not be an integral domain. 


(For students who have had a semester of linear algebra) Let F be a field. Give five different characterizations 
of the elements A of M,(F) that are divisors of 0. 


Redraw Fig. 19.10 to include a subset corresponding to strictly skew fields. 


Proof Synopsis 


21. 


Give a one-sentence synopsis of the proof of the “if” part of Theorem 19.5. 


22. Give a one-sentence synopsis of the proof of Theorem 19.11. 
Theory 
23. An element a of a ring R is idempotent if a* = a. Show that a division ring contains exactly two idempotent 


24, 
25. 


26. 


27, 
28. 


29. 


30. 


elements. 
Show that an intersection of subdomains of an integral domain D is again a subdomain of D. 


Show that a finite ring R with unity 1 4 0 and no divisors of O is a division ring. (It is actually a field, although 
commutativity is not easy to prove. See Theorem 24.10.) [Note: In your proof, to show that a # 0 is a unit, 
you must show that a “Icft multiplicative inverse” of a # 0 in R is also a “right multiplicative inverse.”] 


Let R be a ring that contains at least two elements. Suppose for each nonzero a € R, there exists a unique 
b € R such that aba = a. 


a. Show that R has no divisors of 0. 

b. Show that bab = b. 

c. Show that & has unity. 

d. Show that R is a division ring. 

Show that the characteristic of a subdomain of an integral domain D is equal to the characteristic of D. 


Show that if D is an integral domain, then {n -1{n € Z} is a subdomain of D contained in every subdomain 
of D. 


Show that the characteristic of an integral domain D must be either 0 or a prime p. [Hint. If the characteristic 
of Dis mn, consider (m - 1)(n- 1) in D.] 


This exercise shows that every ring R can be enlarged (if necessary) to a ring S with unity, having the same 
characteristic as R. Let § = R x Z if R has characteristic 0, and R x Z, if R has characteristic n. Let addition 
in S be the usual addition by components, and Jet multiplication be defined by 


(r1, M1 )(ro, No) = (rire $y - 2 + 2°-71,11N2) 


where n -r has the meaning explained in Section 18. 
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a. Show that S is a ring. 

b. Show that S has unity. 

c. Show that 5 and R have the same characteristic. 

d. Show that the map ¢ : R > S given by f(r) = (7, 0) forr € R maps R isomorphically onto a subring of S. 


20.1 Theorem 


20.2 Corollary 


FERMAT’S AND EULER’S THEOREMS 
Fermat’s Theorem 


We know that as additive groups, Z, and Z/nZ are naturally isomorphic, with the coset 
a +nZ corresponding to a for each a € Z,. Furthermore, addition of cosets in Z/nZ 
may be performed by choosing any representatives, adding them in Z, and finding the 
coset of nZ containing their sum. It is easy to see that Z/nZ can be made into a ring by 
multiplying cosets in the same fashion, thatis, by multiplying any chosen representatives. 
While we will be showing this later in a more general situation, we do this special case 
now. We need only show that such coset multiplication is well defined, because the 
associativity of multiplication and the distributive laws will follow immediately from 
those properties of the chosen representatives in Z. To this end, choose representatives 
a-+yrn and b + sn, rather than a and b, from the cosets a + nZ and b + nZ. Then 


(a+rn\(b+sn)=ab4+(as+rb+rsn)n, 


which is also an element of ab + nZ. Thus the multiplication is well-defined, and our 
cosets form a ring isomorphic to the ring Z,. 
The following is a special case of Exercise 37 in Section 18. 


For any field, the nonzero elements form a group under the field multiplication. 


In particular, for Z,, the elements 
1,2,3,---,p—1 


form a group of order p — 1 under multiplication modulo p. Since the order of any 
element in a group divides the order of the group, we see that for b 4 O and b € Z,, we 
have b?—! = 1 in Z,. Using the fact that Z, is isomorphic to the ring of cosets of the 
forma + pZ described above, we see at once that for any a € Znotin the coset 0+ pZ, 
we must have 

a’! =1 (mod p). 


This gives us at once the so-called Little Theorem of Fermat. 


(Little Theorem of Fermat) Ifa € Z and p is a prime not dividing a, then p divides 
a?—! — |, thatis, a@?—' = 1 (mod p) for a # 0 (mod p). 


Ifa € Z, then a? = a (mod p) for any prime p. 


Section 20 Fermat’s and Euler’s Theorems 
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Proof The corollary follows from Theorem 20.1 if a # 0 (mod p). Ifa = 0 (mod p), then both 


sides reduce to 0 modulo p. 


@ HisToRICAL NOTE 


lhe statement of Theorem 20.1 occurs in a letter 

from Pierre de Fermat (1601-1665) to Bernard 
Frenicle de Bessy, dated 18 October 1640. Fermat’s 
version of the theorem was that for any prime p and 
any geometric progression a, a”, ---,a',---, there 
is a least number a! of the progression such that p 
divides a? — 1. Furthermore, T divides p — 1 and 
p also divides all numbers of the form a*? — 1. 
(It is curious that Fermat failed to note the condi- 
tion that p not divide a; perhaps he felt that it was 
obvious that the result fails in that case.) 

Fermat did not in the letter or elsewhere indi- 
cate a proof of the result and, in fact, never men- 
tioned it again. But we can infer from other parts of 


this correspondence that Fermat’s interest in this 
result came from his study of perfect numbers. 
(A perfect number is a positive integer m that is 
the sum of all of its divisors less than m; for ex- 
ample, 6 = 1 + 2 +3 is a perfect number.) Euclid 
had shown that 2"—-!(2" — 1) is perfect if 2” — 1 is 
prime. The question then was to find methods for de- 
termining whether 2” — 1 was prime. Fermat noted 
that 2” — 1 was composite if n is composite, and 
then derived from his theorem the result that if n 
is prime, the only possible divisors of 2” — 1 are 
those of the form 2kn + 1. From this result he was 
able quickly to show, for example, that 277 — 1 was 
divisible by 223 = 2-3-3741. 


* 


9103 = (879887) = (18)(87) = 3 = (5)! 
= (25)°(—5) = (—1)(—5) = 5 (mod 13). 


glL213 -l= pated 7 27] -~l= ppt2t ‘ 27] —] 
= 2?-1=8-1=7 (mod 11). 


Let us compute the remainder of 8!°? when divided by 13. Using Fermat’s theorem, we 


Thus the remainder of 2''!:2!3 — 1 when divided by 11 is 7, not 0. (The number 11,213 
is prime, and it has been shown that 2!1-2!3 — 1 is a prime number. Primes of the form 
2? — 1 where p is prime are known as Mersenne primes.) A 


20.3 Example 
have 
20.4 Example Show that 2!!7!5 — 1 is not divisible by 11. 
Solution By Fermat’s theorem, 2!° = 1 (mod 11), so 
20.5 Example Show that for every integer n, the number n** — nis divisible by 15. 
Solution 


This seems like an incredible result. It means that 15 divides 233 — 2, 3° — 3, 4% — 4, 
etc, 

Now 15 = 3-5, and we shall use Fermat's theorem to show that n** — n is divisible 
by both 3 and 5 for every n. Note that n**? —n = n(n — 1). 
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20.6 Theorem 


Proof 
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If 3 divides n, then surely 3 divides n(n*? — 1). If 3 does not divide n, then by 
Fermat’s theorem, n* = 1 (mod 3) so 


n= —1 = (n?)!© —1 =1'6 — 1 =0 (mod 3), 


and hence 3 divides n>? — 1. 
If n = 0 (mod 5), then n** — n = 0 (mod 5). If n #0 (mod 5), then by Fermat's 
theorem, n* = 1 (mod 5), so 


n@ —1 =(n*)8 —1 = 18-1 =0(mod5). 


Thus n°3 — n = 0 (mod 5) for every n also. + 


Euler’s Generalization 


Euler gave a generalization of Fermat’s theorem. His generalization will follow at once 
from our next theorem, which is proved by counting, using essentially the same argument 
as in Theorem 19.11. 


The set G, of nonzero elements of Z, that are not 0 divisors forms a group under 
multiplication modulo n. 


First we must show that G, is closed under multiplication modulo n. Let a, b € Gy. If 
ab € G,, then there would exist c # 0 in Z, such that (ab)c = 0. Now (ab)c = O implies 
that a(bc) = 0. Since b € G, and c 4 0, we have be # 0 by definition of G,. But then 
a(bc) = O would imply that a ¢ G, contrary to assumption. Note that we have shown that 
for any ring the set of elements that are not divisors of 0 is closed under multiplication. 
No structure of Z, other than ring structure has been involved so far. 

We now show that G, is a group. Of course, multiplication modulo a is associative, 
and 1 € G,. It remains to show that for a € G,, there is b € G, such that ab = 1. Let 


1, Ay,°**, ay 
be the elements of G,. The elements 
al, ada,--+,da, 


are all different, for if aa; = aa;, then a(a; — aj) = 0, and since a € G, and thus is not 
a divisor of 0, we must have a; — aj = 0 or a; = aj. Therefore by counting, we find that 
either al = 1, or some aa; must be 1, so a has a multiplicative inverse. Sd 


Note that the only property of Z, used in this last theorem, other than the fact that 
it was a ring with unity, was that it was finite. In both Theorem 19.11 and Theorem 20.6 
we have (in essentially the same construction) employed a counting argument. Counting 
arguments are often simple, but they are among the most powerful tools of mathematics. 

Let n be a positive integer. Let y(n) be defined as the number of positive integers 
less than or equal to n and relatively prime to n. Note that o(1) = 1. 


Let n = 12. The positive integers less than or equal to 12 and relatively prime to 12 are 
1, 5, 7, and 11, so g(12) = 4. A 


20.8 Theorem 


Proof 


20.9 Example 


20.10 Theorem 


Proof 


20.11 Corollary 


20.12 Theorem 
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By Theorem 19.3, g(m) is the number of nonzero elements of Z, that are not 
divisors of 0. This function g : Z* — Z* is the Euler phi-function. We can now de- 
scribe Euler’s generalization of Fermat’s theorem. 


(Euler’s Theorem) [If a is an integer relatively prime to n, then a?” — 1 is divisible 
by n, that is, a? = 1 (mod n). 


If a is relatively prime to n, then the coset a + nZ of nZ containing a contains an integer 
b <n and relatively prime to n. Using the fact that multiplication of these cosets by 
multiplication modulo x of representatives is well-defined, we have 


a? = b? (mod n). 


But by Theorems 19.3 and 20.6, b can be viewed as an element of the multiplicative 
group G, of order y(n) consisting of the y(n) elements of Z,, relatively prime to n. Thus 


b?™ = 1 (modn), 
and our theorem follows. . 


Let 2 = 12. We saw in Example 20.7 that gy(12) = 4. Thus if we take any integer a 
relatively prime to 12, then a* = 1 (mod 12). For example, with a = 7, we have P= 
(497 = 2, 401 = 12(200) + 1, so 7* = 1 (mod 12). Of course, the easy way to compute 
7* (mod 12), without using Euler’s theorem, is to compute it in Zj2. In Z12, we have 
7=—5s0 


P=a(-SP=6=1 and FPP. Z 


Application to ax = b (mod m) 

Using Theorem 20.6, we can find all solutions of a linear congruence ax = b (mod m). 
We prefer to work with an equation in Z,, and interpret the results for congruences. 
Let m be a positive integer and let a € Z,, be relatively prime to m. For each b € Z,, 
the equation ax = b has a unique solution in Z,. 


By Theorem 20.6, a is a unit in Z, and s = a~'b is certainly a solution of the equation. 
Multiplying both sides of ax = b on the left by a7!, we see this is the only solution. 
5 


Interpreting this theorem for congruences, we obtain at once the following corollary. 


If a and m are relatively prime intergers, then for any integer b, the congruence ax = 
b (mod m) has as solutions all integers in precisely one residue class modulo m. 


Theorem 20.10 serves as a lemma for the general case. 
Let m be a positive integer and let a, b € Z,,. Let d be the gcd of a and m. The equation 


ax = b has a solution in Z,, if and only if d divides b. When d divides b, the equation 
has exactly d solutions in Z,. 
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20.14 Example 


Solution 


20.15 Example 


Solution 
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First we show there is no solution of ax = b in Z,, unless d divides b. Suppose s € Z» is 
a solution. Then as — b = gm in Z, so b = as — qm. Since d divides both a and m, we 
see that d divides the right-hand side of the equation b = as — qm, and hence divides 
b. Thus a solution s can exist only if d divides b. 

Suppose now that d does divide b. Let 


a=ad, b=b,d, and m=myd. 


Then the equation as — b = gm in Z can be rewritten as d(a;s — by) = dgqm,. We see 
that as — bisa multiple of m if and only ifa,s — b; isa multiple of m,. Thus the solutions 
s of ax = b in Z,, are precisely the elements that, read modulo my, yield solutions of 
ax = b, in Z,,. Now lets € Z,, be the unique solution of ajx = b, in Zm, given by 
Theorem 20.10, The numbers in Z,, that reduce to s modulo m are precisely those that 
can be computed in Z,, as 


s,s +m,,s+2m,5+3m,,---,s+(d—-— Dm. 


Thus there are exactly d solutions of the equation in Z,,. Sd 


Theorem 20.12 gives us at once this classical result on the solutions of a linear 
congruence. 


Let d be the gcd of positive integers a and m. The congruence ax = b (mod m) has a 
solution if and only if d divides b. When this is the case, the solutions are the integers in 
exactly d distinct residue classes modulo m. 


Actually, our proof of Theorem 20.12 shows a bit more about the solutions of ax = b 
(mod m1) than we stated in this corollary; namely, it shows that if any solution s is found, 
then the solutions are precisely all elements of the residue classes (s + km) + (mZ) 
where m, = m/d and k runs through the integers from 0 tod — 1. It also tells us that we 
can find such an s by finding a, = a/d and b) = b/d, and solving a,x = by (mod m)). 
To solve this congruence, we may consider a and b, to be replaced by their remainders 
modulo m, and solve the equation a;x = b, in Zm,. 


Find all solutions of the congruence 12x = 27 (mod 18). 


The gcd of 12 and 18 is 6, and 6 is not a divisor of 27. Thus by the preceding corollary, 
there are no solutions. A 


Find all solutions of the congruence 15x = 27 (mod 18). 


The ged of 15 and 18 is 3, and 3 does divide 27. Proceeding as explained before Ex- 
ample 20.14, we divide everything by 3 and consider the congruence 5x = 9 (mod 6), 
which amounts to solving the equation 5x = 3 in Ze. Now the units in Ze are 1 and 
5, and 5 is clearly its own inverse in this group of units. Thus the solution in Z¢ is 
x = (571)(3) = (5)(3) = 3. Consequently, the solutions of 15x = 27 (mod 18) are the 
integers in the three residue classes. 

3+ 18Z = {---, —33, —15, 3, 21, 39, +--+}, 

94+ 18Z ={---,-—27, —9, 9,27, 45, ---}. 


15 + 18Z = {---, —21, —3, 15, 33, 51, +++}, 
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illustrating Corollary 20.13. Note the d = 3 solutions 3, 9, and 15 in Zig. All the solutions 
in the three displayed residue classes modulo 18 can be collected in the one residue class 
3 + 6Z modulo 6, for they came from the solution x = 3 of 5x = 3 in Ze. A 
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Computations 


We will see later that the multiplicative group of nonzero elements of a finite field is cyclic. Illustrate this by finding 
a generator for this group for the given finite field. 


1. Z 2. Zi; 3. Zi7 
4, Using Fermat’s theorem, find the remainder of 3*7 when it is divided by 23. 

5. Use Fermat’s theorem to find the remainder of 37 when it is divided by 7. 
6. 


. Compute the remainder of 22”) + 1 when divided by 19. [Hint: You will need to compute the remainder of 
2!7 modulo 18.] 


. Make a table of values of g() forn < 30. 


~~ 


8. Compute v(p?) where p is a prime. 
9. Compute y(pq) where both p and gq are primes. 
10 


Use Euler’s generalization of Fermat’s theorem to find the remainder of 7!°° when divided by 24. 


In Exercises 11 through 18, describe all solutions of the given congruence, as we did in Examples 20.14 and 20.15. 


11. 2x = 6 (mod 4) 12. 22x = 5 (mod 15) 
13. 36x = 15 (mod 24) 14, 45x = 15 (mod 24) 
15. 39x = 125 (mod 9) 16. 41x = 125 (mod 9) 
17. 155x = 75 (mod 65) 18. 39x = 52 (mod 130) 


19, Let p be a prime >3. Use Exercise 28 below to find the remainder of (p — 2)! modulo p. 
20. Using Exercise 28 below, find the remainder of 34! modulo 37. 
21. Using Exercise 28 below, find the remainder of 49! modulo 53. 
22. Using Exercise 28 below, find the remainder of 24! modulo 29. 


Concepts 

23. Mark each of the following true or false. 

a. a’—' = 1 (mod p) for all integers a and primes p. 

b. a?—! = 1 (mod p) for all integers a such that a # 0 (mod p) for a prime p. 


— e. o(n) <n foralln € Z*. 
g(n) <n—A1forallne Zr. 


The units in Z,, are the positive integers less than » and relatively prime to n. 


The product of two nonunits in Z, may be a unit. 
The product of a unit and a nonunit in Z, is never a unit. 


d. 
e. 
f. The product of two units in Z,, is always a unit. 
g. 
h. 
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i. Every congruence ax = b (mod p), where p is a prime, has a solution. 
j. Let d be the ged of positive integers a and m. If d divides b, then the congruence ax = b (mod m) 
has exactly d incongruent solutions. 


24. Give the group multiplication table for the multiplicative group of units in Z;2. To which group of order 4 is it 
isomorphic? 


Proof Synopsis 
25. Give a one-sentence synopsis of the proof of Theorem 20.1. 
26. Give a one-sentence synopsis of the proof of Theorem 20.8. 


Theory 


27, Show that 1 and p — | are the only elements of the field Z, that are their own multiplicative inverse. [Hint: 
Consider the equation x? — 1 = 0.] 


28. Using Exercise 27, deduce the half of Wilson’s theorem that states that if p is a prime, then (p — 1)! =—1 
(mod p). [The other half states that if n is an integer >1 such that (n — 1)! = —1 (mod n), then n is a prime. 
Just think what the remainder of (n — 1)! would be modulo x if n is not a prime.] 


29, Use Fermat’s theorem to show that for any positive integer n, the integer n°’ — 7 is divisible by 383838. [Hint: 
383838 = (37)(19)(13)(7)(3)(2).] 
30. Referring to Exercise 29, find a number larger than 333838 that divides n>’ — n for all positive integers n. 


Tue FIELD OF QUOTIENTS OF AN INTEGRAL DOMAIN 


If an integral domain is such that every nonzero element has a multiplicative inverse, 
then it is a field. However, many integral domains, such as the integers Z, do not form a 
field. This dilemma is not too serious. It is the purpose of this section to show that every 
integral domain can be regarded as being contained in a certain field, a field of quotients 
of the integral domain. This field will be a minimal field containing the integral domain 
in a sense that we shall describe. For example, the integers are contained in the field 
Q, whose elements can all be expressed as quotients of integers. Our construction of a 
field of quotients of an integral domain is exactly the same as the construction of the 
rational numbers from the integers, which often appears in a course in foundations or 
advanced calculus. To follow this construction through is such a good exercise in the use 
of definitions and the concept of isomorphism that we discuss it in some detail, although 
to write out, or to read, every last detail would be tedious. We can be motivated at every 
step by the way Q can be formed from Z. 


The Construction 


Let D be an integral domain that we desire to enlarge to a field of quotients F. A coarse 
outline of the steps we take is as follows: 


1. Define what the elements of F are to be. 
2. Define the binary operations of addition and multiplication on F. 


21.1 Definition 


21.2 Lemma 
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3. Check all the field axioms to show that F is a field under these operations. 
4. Show that F can be viewed as containing D as an integral subdomain. 


Steps 1, 2, and 4 are very interesting, and Step 3 is largely a mechanical chore. We 
proceed with the construction. 


Step 1 Let D be a given integral domain, and form the Cartesian product 
Dx D= {(,b)|a,b € D} 


We are going to think of an ordered pair (a, b) as representing a formal quotient a/b, 
that is, if D = Z, the pair (2, 3) will eventually represent the number 5 for us. The pair 
(2, 0) represents no element of Q and suggests that we cut the set D x D down a bit. 
Let S be the subset of D x D given by 


S={(a,6)|a,b€D,b £0}. 


Now S is still not going to be our field as is indicated by the fact that, with D = Z, 
different pairs of integers such as (2, 3) and (4, 6) can represent the same rational 
number. We next define when two elements of S will eventually represent the same 
element of F, or, as we shall say, when two elements of S are equivalent. 


Two elements (a, b) and (c, d) in S are equivalent, denoted by (a, b) ~ (c, d), if and 
only if ad = be. a 


Observe that this definition is reasonable, since the criterion for (a, b) ~ (c, d) is an 
equation ad = be involving elements in D and concerning the known multiplication in 
D. Note also that for D = Z, the criterion gives us our usual definition of equality of ¢ 
with 5, for example, : = é since (2)(6) = (3)(4). The rational number that we usually 
denote by 5 can be thought of as the collection of all quotients of integers that reduce 
to, or are equivalent to, 2, 


The relation ~ between elements of the set S as just described is an equivalence relation. 


Proof We must check the three properties of an equivalence relation. 


Reflexive (a, b) ~ (a, b) since ab = ba, for multiplication in D is commutative. 


Symmetric If (a, b) ~ (c, d), then ad = bc. Since multiplication in D is commu- 
tative, we deduce that cb = da, and consequently (c, d) ~ (a, b). 


Transitive If (a, b) ~ (c, d) and (c,d) ~ (, s), then ad = be and cs = dr. Using 
these relations and the fact that multiplication in D is commutative, we have 


asd = sad = sbc = bcs = bdr = brd. 


Nowd + 0, and D is an integral domain, so cancellation is valid; this is a crucial step 
in the argument. Hence from asd = brd we obtain as = br, so that (a, b) ~ (7, s). 


¢ 
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We now know, in view of Theorem 0.22, that ~ gives a partition of S into equivalence 
classes. To avoid long bars over extended expressions, we shall let [(@, 0)], rather than 
(a, b), be the equivalence class of (a, b) in S under the relation ~. We now finish Step 1 
by defining F' to be the set of all equivalence classes [(a, b)] for (a, b) € S. 


Step 2 The next lemma serves to define addition and multiplication in F. 
Observe that if D = Z and [(a, b)] is viewed as (a/b) € Q, these definitions applied to 
Q give the usual operations. 

For [(a, b)] and [(c, d)] in F, the equations 
[(a, b)] + [(e, d)] = [(ad + bc, bd)] 


and 


[(a, b)]E(c, €)] = [(ae, bd)] 

give well-defined operations of addition and multiplication on J’. 
Observe first that if [(a, b)] and [(c, d)] are in F, then (a, b) and (c, d) arein S, sob #0 
and d # 0. Because D is an integral domain, bd 4 0, so both (ad + bc, bd) and (ac, bd) 
are in S. (Note the crucial use here of the fact that D has no divisors of 0.) This shows 
that the right-hand sides of the defining equations are at least in F. 

It remains for us to show that these operations of addition and multiplication are 
well defined. That is, they were defined by means of representatives in S of elements of 
F, so we must show that if different representatives in S are chosen, the same element 


of F will result. To this end, suppose that (a), 51) € [(a, b)] and (ci, d1) € [(c, d)]. We 
must show that 


(ayd, + bic, bid,) € [(ad + be, bd)] 
and 
(ajc), bjd;) € [(ae, bd)]. 
Now (a1, 61) € [(a, B)] means that (a), b,) ~ (a, 6); that is, 
ayb = bya. 
Similarly, (c:, d,) € [(c, d)] implies that 
cyd = dc. 


To get a “common denominator” (common second member) for the four pairs (a, b), 
(a1, by), (c, a), and (c;, d,), we multiply the first equation by d)d and the second equation 
by b,b. Adding the resulting equations, we obtain the following equation in D: 


a\bd\d + cjdbyb = byad\d + dich, b. 
Using various axioms for an integral domain, we see that 


(aid, + b1c})bd = byd\(ad + bc), 
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so 
(a,d, + bye1, byd1) ~ (ad + be, bd), 


giving (a,d, + b,c), bid)) € [(ad + be, bd)]. This takes care of addition in F. For mul- 
tiplication in F, on multiplying the equations a,b = bya and c,d = dc, we obtain 


aibc,d = b,ad\c, 
so, using axioms of D, we get 
a Cbd = bi d,ac, 


which implies that 


(ayc1, bid\) ~ (ac, bd). 
Thus (a1¢1, b1d;) € [(ac, bd)], which completes the proof. re 


It is important to understand the meaning of the last lemma and the necessity for 
proving it. This completes our Step 2. 


Step 3 Step 3 is routine, but it is good for us to work through a few of these 
details. The reason for this is that we cannot work through them unless we understand 
what we have done. Thus working through them will contribute to our understanding of 
this construction. We list the things that must be proved and prove a couple of them. 
The rest are left to the exercises. 


1. Addition in F is commutative. 


Proof Now [(a, b)} + [(c, d)] is by definition [(ad + bc, bd)}. Also [(c, d)] + [(a, B)] is by 
definition [((cb + da, db)]. We need to show that (ad + bc, bd) ~ (cb + da, db). This 
is true, since ad + be = cb + da and bd = db, by the axioms of D. 5 

2. Addition is associative. 

[(0, 1)] is an identity element for addition in F. 

[(—a, b)] is an additive inverse for [(a, b)] in F. 

Multiplication in F is associative. 

Multiplication in F is commutative. 

The distributive laws hold in F. 

[(1, 1)] is a multiplicative identity element in F. 


Lf ((a, b)] € F is not the additive identity element, then a ~4 0 in D and 
[(b, a)] is a multiplicative inverse for [(a, b)]. 


a 


Proof Let [(a,5)| € F. Ifa = 0, then 
al =b0=0, 
so 


(a,b) ~ 0, 1). 
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that is, [(a, b)] = [(0, 1)]. But [(0, 1)] is the additive identity by Part 3. Thus if [(a, b)] 
is not the additive identity in F, we have a # 0, so it makes sense to talk about [(b, a)} 
in F. Now [(a, b)|[(b, a)] = [(ab, ba)]. But in D we have ab = ba, so (ab)1 = (ba)l, 
and 


(ab, ba) ~ C1, 1). 
Thus 
[(a, bY, a)] = (0, DI, 
and [(1, 1)] is the multiplicative identity by Part 8. e 
This completes Step 3. 


Step 4 It remains for us to show that F can be regarded as containing D. To do 
this, we show that there is an isomorphism i of D with a subdomain of F. Then if we 
rename the image of D under i using the names of the elements of D, we will be done. 
The next lemma gives us this isomorphism. We use the letter i for this isomorphism to 
suggest injection (see the footnote on page 4); we will inject D into F. 


The map i: D > F given by i(a) = [(, 1)] is an isomorphism of D with a subring 
of F. 


For a and b in D, we have 
ifa+ b)=[a@+8, 1)). 
Also, 
i(a) + i(b) = [(a, D}+(@. 1] = (al + 14, D) = [a +, DI. 

so i(a + b) = i(a) + i(b). Furthermore, 

i(ab) = [(ab, DI, 
while 

i(ayi(b) = [(a, DIL, 1D] = [(a, DI, 
so i(ab) = i(a)i(b). 
It remains for us to show only that i is one to one. If i(a) = (6), then 

ia, Db] = [@, 1], 

so (a, 1) ~ (b, 1) giving a1 = 18; thatis, 
a=b. 

Thus i is an isomorphism of D with i[D], and, of course, i [D] is then a subdomain 
of F. ¢ 


Since [(a. b)] = [(a, DIA. dy] = La, DI/[@. 1] = 1(@)/i(6) clearly holds in F, 
we have now proved the following theorem. 


Any integral domain D can be enlarged to (or embedded in) a field F such that every 
element of F can be expressed as a quotient of two elements of D. (Such a field F is a 
field of quotients of D.) 


21.6 Theorem 


Proof 
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Uniqueness 


We said in the beginning that F could be regarded in some sense as a minimal field 
containing D. This is intuitively evident, since every field containing D must contain 
all elements a/b for every a, b € D with b $ 0. The next theorem will show that every 
field containing D contains a subfield which is a field of quotients of D, and that any 
two fields of quotients of D are isomorphic. 


Let F be a field of quotients of D and let L be any field containing D. Then there exists a 
map y : F — L that gives an isomorphism of F with a subfield of L such that (a) = a 
fora eé D. 


The subring and mapping diagram in Fig. 21.7 may help you to visualize the situation 
for this theorem. 

An element of F is of the form a /r b where /» denotes the quotient of a ¢ D by 
b € D regarded as elements of F. We of course want to map a /p b onto a /, b where 
/, denotes the quotient of elements in L. The main job will be to show that such a map 
is well defined. 

We must define y : F — L, and we start by defining 

wa)=a for aéD. 


Every x € F is a quotient a /r b of some two elements a and b, b 4 0, of D. Let us 
attempt to define y by 
Wa fe by =W@) /, VO). 
We must first show that this map is sensible and well-defined. Since yr is the identity 
on D, for b 4 0 we have w(b) $ 0, so our definition of W(a /r b) as w(a) /, w(b) makes 
sense. If a /r b=c /r d in F, then ad = bc in D, so (ad) = (bc). But since w is 
the identity on D, 
Yiad)=W(a)v@) and wbc)= (bye). 
Thus 
WOAhtvo=vohw@ 

in L, so & is well-defined. 

The equations 


Vay) = vv) 


L 
ee ¥ ___y WIFI 
“ oe 
D 


21.7 Figure 
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and 


vaty)=V@)+ vO) 


follow easily from the definition of w on F and from the fact that y is the identity on D. 
If w(a /r b) = Wc /p d), we have 


V@ 1 vO)=VvO LvV@ 


so 


YOV@ = VO). 


Since w is the identity on D, we then deduce that ad = bc, soa /p b =c /p d. Thus yw 
is one to one. 
By definition, w(a) = a fora € D. 4 


21.8 Corollary Every field Z containing an integral domain D contains a field of quotients of D. 


Proof Inthe proof of Theorem 21.6 every element of the subfield w[F] of Z is a quotient in L 
of elements of D. ¢ 


21.9 Corollary Any two fields of quotients of an integral domain D are isomorphic. 


Proof Suppose in Theorem 21.6 that L is a field of quotients of D, so that every element x of L 
can be expressed in the form a /, b for a, b € D. Then L is the field w[F] of the proof 
of Theorem 21.6 and is thus isomorphic to F’. ¢e 
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Computations 
1. Describe the field F of quotients of the integral subdomain 
D={n+miln,m € Z} 


of C. “Describe” means give the elements of C that make up the field of quotients of D in C. (The elements of 
D are the Gaussian integers.) 


2. Describe (in the sense of Exercise 1) the field F of quotients of the integral subdomain D = {n + mJ/2|n,meZ} 
of R. 


Concepts 


3. Correct the definition of the italicized term without reference to the text, if correction is needed, so that it is in 
a form acceptable for publication. 


A field of quotients of an integral domain D is a field F in which D can be embedded so that every nonzero 
element of D is a unit in F. 
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. Mark each of the following true or false. 


a. Qis a field of quotients of Z. 
b. Ris a field of quotients of Z. 
ce. Ris a field of quotients of R. 
d. Cis a field of quotients of R. 
e. If D is a field, then any field of quotients of D is isomorphic to D. 
f. The fact that D has no divisors of 0 was used strongly several times in the construction of a field 
F of quotients of the integral domain D. 
g. Every element of an integral domain D is a unit in a field F of quotients of D. 
______ h. Every nonzero element of an integral domain D is a unit in a field F of quotients of D. 
i. A field of quotients F’ of a subdomain D’ of an integral domain D can be regarded as a subfield 
of some field of quotients of D. 
j. Every field of quotients of Z is isomorphic to Q. 


. Show by an example that a field F’ of quotients of a proper subdomain D’ of an integral domain D may also 


be a field of quotients for D. 


Theory 


6. 
Te 
8. 
9. 
10. 
11. 
12. 


13. 


14. 
15. 


16. 


17, 


Prove Part 2 of Step 3. You may assume any preceding part of Step 3. 
Prove Part 3 of Step 3. You may assume any preceding part of Step 3. 
Prove Part 4 of Step 3. You may assume any preceding part of Step 3. 
Prove Part 5 of Step 3. You may assume any preceding part of Step 3. 
Prove Part 6 of Step 3. You may assume any preceding part of Step 3. 
Prove Part 7 of Step 3. You may assume any preceding part of Step 3. 


Let R be a nonzero commutative ring, and let T be a nonempty subset of R closed under multiplication and 
containing neither 0 nor divisors of 0. Starting with R x T and otherwise exactly following the construction in 
this section, we can show that the ring R can be enlarged to a partial ring of quotients Q(R, T). Think about 
this for 15 minutes or so; look back over the construction and see why things still work. In particular, show the 
following: 


a. Q(R, T) has unity even if R does not. 
b. In Q(R, T), every nonzero element of T is a unit. 


Prove from Exercise 12 that every nonzero commutative ring containing an element a that is not a divisor of 0 
can be enlarged to a commutative ring with unity. Compare with Exercise 30 of Section 19. 


With reference to Exercise 12, how many elements are there in the ring Q(Zu, {1, 3})? 


With reference to Exercise 12, describe the ring Q(Z, {2"|n € Z*}), by describing a subring of R to which it 
is isomorphic. 

With reference to Exercise 12, describe the ring O(3Z, {6" |n € Z*}) by describing a subring of R to which it 
is isomorphic. 

With reference to Exercise 12, suppose we drop the condition that T have no divisors of zero and just require 
that nonempty 7 not containing 0 be closed under multiplication. The attempt to enlarge R to a commutative 
ring with unity in which every nonzero element of T is a unit must fail if T contains an element a that is a 
divisor of 0, for a divisor of 0 cannot also be a unit. Try to discover where a construction parallel to that in the 
text but starting with R x T first runs into trouble. In particular, for R = Zp and T = {1, 2, 4}, illustrate the 
first difficulty encountered. [Hint: It is in Step 1.] 
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RINGS OF POLYNOMIALS 
Polynomials in an Indeterminate 


We all have a pretty workable idea of what constitutes a polynomial in x with coefficients 
in a ring R. We can guess how to add and multiply such polynomials and know what is 
meant by the degree of a polynomial. We expect that the set R[] of all polynomials with 
coefficients in the ring R is itself a ring with the usual operations of polynomial addition 
and multiplication, and that R is a subring of R[x]. However, we will be working with 
polynomials from a slightly different viewpoint than the approach in high school algebra 
or calculus, and there are a few things that we want to say. 

In the first place, we will call x an indeterminate rather than a variable. Suppose, for 
example that our ring of coefficients is Z. One of the polynomials in the ring Z[x] is 1x, 
which we shall write simply as x. Now x is not 1 or 2 or any of the other elements of Z[x]}. 
Thus from now on we will never write such things as “x = 1” or “x = 2,” as we have 
done in other courses. We call x an indeterminate rather than a variable to emphasize this 
change. Also, we will never write an expression such as “x? _ 4 = 0,” simply because 
x? — 4 is not the zero polynomial in our ring Z[x]. We are accustomed to speaking of 
“solving a polynomial equation,” and will be spending a lot of time in the remainder of 
our text discussing this, but we will always refer to it as “finding a zero ofa polynomial.” 
In summary, we try to be careful in our discussion of algebraic structures not to say in 


one context that things are equal and in another context that they are not equal. 


we HISTORICAL NOTE 


Ihe use of x and other letters near the end of 

the alphabet to represent an “indeterminate” 
is due to René Descartes (1596-1650). Earlier, 
Francois Viete (1540-1603) had used vowels for in- 
determinates and consonants for known quantities. 
Descartes is also responsible for the first publication 
of the factor theorem (Corollary 23.3) in his work 
The Geometry, which appeared as an appendix to 
his Discourse on Method (1637). This work also 
contained the first publication of the basic concepts 
of analytic geometry; Descartes showed how geo- 
metric curves can be described algebraically. 

Descartes was born to a wealthy family in 
La Haye, France; since he was always of delicate 
health, he formed the habit of spending his mornings 
in bed. It was at these times that he accomplished 
his most productive work. The Discourse on Method 
was Descartes’ attempt to show the proper proce- 
dures for “searching for truth in the sciences.” The 
first step in this process was to reject as absolutely 


false everything of which he had the least doubt; but, 
since it was necessary that he who was thinking was 
“something,” he conceived his first principle of phi- 
losophy: “I think, therefore I am.” The most enlight- 
ening parts of the Discourse on Method, however, 
are the three appendices: The Optics, The Geometry, 
and The Meteorology. It was here that Descartes 
provided examples of how he actually applied his 
method. Among the important ideas Descartes dis- 
covered and published in these works were the sine 
law of refraction of light, the basics of the theory 
of equations, and a geometric explanation of the 
rainbow. 

In 1649, Descartes was invited by Queen 
Christina of Sweden to come to Stockholm to tutor 
her. Unfortunately, the Queen required him, cont- 
rary to his long-established habits, to rise at an carly 
hour. He soon contracted a lung disease and died in 
1650. 


22.1 Definition 


Section 22 Rings of Polynomials 199 


If a person knows nothing about polynomials, it is not an easy task to describe 
precisely the nature of a polynomial in x with coefficients in a ring R. If we just define 
such a polynomial to be a finite formal sum 


an 
So ajx' = ayo + ayx +--+ +ayx", 
i=0 
where a; € R, we get ourselves into a bit of trouble. For surely 0 + ayx andO+ a,x + 
Ox? are different as formal sums, but we want to regard them as the same polynomial. A 
practical solution to this problem is to define a polynomial as an infinite formal sum 


[oe] 
Di aix! = ay tax tee tage" boo, 
i=0 
where a; = 0 for all but a finite number of values of i. Now there is no problem of having 
more than one formal sum represent what we wish to consider a single polynomial. 


Let R be aring. A polynomial f(x) with coefficients in R is an infinite formal sum 


wo 
yo ae =aytayxt---+ayx™+---, 
i=0 
where a; € R anda; = 0 for all but a finite number of values of i. The a; are coefficients 
of f(x). If for some i > 0 it is true that a; + 0, the largest such value of i is the degree 
of f(x). If all a; = 0, then the degree of f(x) is undefined.' | 


To simplify working with polynomials, let us agree that if f(x) = ap + ayx +---+ 
a,x" +--- has a; = 0 fori > n, then we may denote f(x) by agp + ayx +--+ + ayx". 
Also, if R has unity 1 4 0, we will write a term 1x* in such a sum as x*. For example, 
in Z[x], we will write the polynomial 2 + 1x as 2+ x. Finally, we shall agree that we 
may omit altogether from the formal sum any term Ox', or ap if ag = O but notalla; = 0. 
Thus 0, 2, x, and 2+ x? are polynomials with coefficients in Z. An element of R is a 
constant polynomial. 

Addition and multiplication of polynomials with coefficients in a ring R are defined 
in a way familiar to us. If 


SQ) = ao tayx tess tayx" +--+ 
and 

B(x) = bot dix t- Fb x +.--, 
then for polynomial addition, we have 


f@) + g(x) = Co eX tere t Cyx" + +++ where Cy = ay + bn, 


7 The degree of the zero polynomial is sometimes defined to be —1, which is the first integer less than 0, or 
defined to be —oo so that the degree of f(x)g(x) will be the sum of the degrees of f(x) and g(x) if one of 
them is zero. 
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22.2 Theorem 


Proof 


Rings and Fields 


and for polynomial multiplication, we have 
flx)g(x) = do + dix +--+. +dyx" +++» where dy = YO" aibn-i 


Observe that both c; and d; are 0 for all but a finite number of values of i, so these 
definitions make sense. Note that )“/_, ajb,—; need not equal }°7_9 bjan—; if R is not 
commutative. With these definitions of addition and multiplication, we have the following 
theorem. 


The set R[x] of all polynomials in an indeterminate x with coefficients in a ring R is a 
ring under polynomial addition and multiplication. If R is commutative, then so is R[x], 
and if R has unity 1 4 0, then 1 is also unity for R[x]. 


That (R[x], +) is an abelian group is apparent. The associative law for multiplication 
and the distributive laws are straightforward, but slightly cumbersome, computations. 
We illustrate by proving the associative law. 

Applying ring axioms to a;, b;, c, € R, we obtain 


(=e ll el 7 


I 
iMs 


z E(B) 
-(E«)(E\(E~)] 


Whew!! In this computation, the fourth expression, having just two summation signs, 
should be viewed as the value of the triple product f(x) g(x)A(x) of these polynomials 
under this associative multiplication. (In a similar fashion, we view f(g(h(x))) as the 
value of the associative composition (f o g o h)(x) of three functions f, g, and h.) 

The distributive laws are similarly proved. (See Exercise 26.) 

The comments prior to the statement of the theorem show that R[x]isa commutative 
ring if R is commutative, and a unity 1 # 0 in R is also unity for R[x], in view of the 
definition of multiplication in R[x]. ¢ 


Il 
aw 
Me 
& 

+ 


Thus Z[x] is the ring of polynomials in the indeterminate x with integral coefficients, 
O[x] the ring of polynomials in x with rational coefficients, and so on. 


22.3 Example 


22.4 Theorem 


Proof 
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In Z[x], we have 
+12 =@4De4+¢D=2x2 404 Det lax? 
Still working in Z,[x], we obtain 
(w+)D4@4D)=04)D*4+04+1)=0770=0. A 


If Ris aring and x and y are two indeterminates, then we can form the ring (R[x ])[y], 
that is, the ring of polynomials in y with coefficients that are polynomials in x. Every 
polynomial in y with coefficients that are polynomials in x can be rewritten in a natu- 
ral way as a polynomial in x with coefficients that are polynomials in y as illustrated 
by Exercise 20. This indicates that (R[x])[y] is naturally isomorphic to (R[y]){x], al- 
though a careful proof is tedious. We shall identify these rings by means of this natural 
isomorphism, and shall consider this ring R[x, y] the ring of polynomials in two inde- 
terminates x and y with coefficients in R. The ring R[x;, ---, x,] of polynomials in 
the n indeterminates x; with coefficients in R is similarly defined. 

We leave as Exercise 24 the proof that if D is an integral domain then so is D[x]. In 
particular, if F is a field, then F'[x] is an integral domain. Note that Fx] is nota field, for 
x is not a unit in F[x]. That is, there is no polynomial f(x) € F[x] such that xf(x) = 1. 
By Theorem 21.5, one can construct the field of quotients F(x) of F[x]. Any element 
in F(x) can be represented as a quotient f(x)/g(x) of two polynomials in F[x] with 
a(x) # 0. We similarly define F(x), ---, x») to be the field of quotients of F'[x1, ---, xn]. 
This field F(x1, +++, X,) is the field of rational functions in n indeterminates over F. 
These fields play a very important role in algebraic geometry. 


The Evaluation Homomorphisms 


We are now ready to proceed to show how homomorphisms can be used to study what we 
have always referred to as “solving a polynomial equation.” Let E and F be fields, with F 
a subfield of E, that is, F < E. The next theorem asserts the existence of very important 
homomorphisms of F[x] into E. These homomorphisms will be the fundamental tools 
for much of the rest of our work. 


(The Evaluation Homomorphisms for Field Theory) Let F be a subfield of a field 
E, let a be any element of £, and let x be an indeterminate. The map ¢, : F[x] ~ E 
defined by 


aldo + a1x + +++ + a_x") = ay + aya +++ + aq” 
for (ag + ayx +---+a,x") € F[x]is ahomomorphism of F[x] into E. Also, d¢(%) = 


a, and @y maps F isomorphically by the identity map; that is, ¢.(@) = a fora ¢ F. The 
homomorphism ¢, is evaluation at a. 


The subfield and mapping diagram in Fig. 22.5 may help us to visualize this situation. 
The dashed lines indicate an element of the set. The theorem is really an immediate 
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E 
Pe 
Fx] ———_—_—_> bol FLT 
*S x ~s@ = b,(%) 
Identity map 
“ as PL. 

“a ~a= (4) 

22.5 Figure 


consequence of our definitions of addition and multiplication in F[x]. The map $y is 
well defined, that is, independent of our representation of f(x) € F[x] as a finite sum 


Ag + A,X +++ + yx”, 
since such a finite sum representing f(x) can be changed only by insertion or deletion 
of terms Ox', which does not affect the value of de (f(x)). 
If f(x) = ap taux tes: tayx", g(x) = bo + bix +++ + bmx”, and h(x) = 
f(x) + gx) = co Feyx +++» +x", then 
bal f (x) + g(x) = bal(h(x)) = co tere +--+ + c,e", 
while 
bal f(x) + balgx)) = (ao + are + +++ + ane”) + (Bo + bie + +++ + dm”). 
Since by definition of polynomial addition we have c; = a; + b;, we see that 
bal f(x) + (x) = bal f)) + Gals). 
Turning to multiplication, we see that if 
F(xdg(x) = do + dix +--+ +dsx°, 
then 
bal f(x)g(x)) = do + dia +--+ dsc", 
while 


[bal fe) be(g(e))] = (ao + ara + +++ + a0") (bo + dia + +++ + Bm”). 


22.6 Example 


22.7 Example 


22.8 Example 
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Since by definition of polynomial multiplication dj = ae a;bj_;, we see that 
bal f(~)g@)) = [bal fF) balg))].- 


Thus ¢, is a homomorphism. 

The very definition of ¢, applied to a constant polynomial a € F[x], where a € F. 
gives 6,(a) = a, 80 @, maps F isomorphically by the identity map. Again by definition 
of dy, we have @y(x) = dy(1x) = la =a. 5 


We point out that this theorem is valid with the identical proof if F and E are 
merely commutative rings with unity rather than fields. However, we shall be interested 
primarily in the case in which they are fields. 

Itis hard to overemphasize the importance of this simple theorem for us. Itis the very 
foundation for all of our further work in field theory. It is so simple that it could justifiably 
be called an observation rather than a theorem. It was perhaps a little misleading to write 
out the proof because the polynomial notation makes it look so complicated that you 
may be fooled into thinking it is a difficult theorem. 


Let F be Q and E be R in Theorem 22.4, and consider the evaluation homomorphism 
oo : Q[x] — R. Here 


o(ao + aix + +++ + a_x") = ay +a,0+ +--+ a, 0" = ap. 
Thus every polynomial is mapped onto its constant term. A 


Let E be Q and E be R in Theorem 22.4 and consider the evaluation homomorphism 
2 : Q[x] — R. Here 


$2 (ao + ax + -+++,X") = dy + ,2 +--+ + Gy2", 
Note that 
do(x? +x —6) =2?4+2-6=0. 
Thus x? + x — 6 is in the kernel N of ¢2. Of course, 
+x —-6 = (x — 2x + 3), 
and the reason that $)(x? + x — 6) = 0 is that d(x — 2) = 2-2=0. A 


Let F be Q and E be C in Theorem 22.4 and consider the evaluation homomorphism 
; : QLx] > C. Here 


@i(ao + ax + +--+ a_x") = a9 + agi +--+ ani” 
and @;(x) = i. Note that 
MX? +)=P+1=0, 


so x? + Lis in the kernel N of ¢;. A 
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Let F be Q and let E be R in Theorem 22.4 and consider the evaluation homomorphism 
ox : Q[x] — R. Here 


bn (ao + 1x + +++ + nx") = ay bam +o + ant". 


Itcan be proved that ag + aj tees a," = Oifandonlyifa; = Ofori = 0,1,---,n. 
Thus the kernel of ¢, is {0}, and ¢, is a one-to-one map. This shows that all formal 
polynomials in x with rational coefficients form a ring isomorphic to Q[x] in a natural 
way with @,(x) =z. A 


The New Approach 


We now complete the connection between our new ideas and the classical concept of 
solving a polynomial equation. Rather than speak of solving a polynomial equation, we 
shall refer to finding a zero of a polynomial. 


Let F be a subfield of a field E, and let a be an element of E. Let f(x) =a) + 
ax +++ +a,x" be in F[x], and let bq | F[x] > E be the evaluation homomorphism 
of Theorem 22.4. Let f(a) denote 


da ( f(x) = dp t+ ayatere + ana”. 
If f(a) = 0, then a is a zero of FC). | 


In terms of this definition, we can rephrase the classical problem of finding all real 


_ numbers r such that r? +r — 6 = 0 by letting F = Q and E = R and finding alla ¢ R 


such that 
da(x? +x — 6) = 0, 
that is, finding all zeros of x? +x — 6 in R. Both problems have the same answer, since 
{a € R| de(x? +x —6) =0} ={r E Riv? +r —6 = 0} = {2, -3}. 


It may seem that we have merely succeeded in making a simple problem seem quite 
complicated. In fact, what we have done is to phrase the problem in the language of 
mappings, and we can now use all the mapping machinery that we have developed and 
will continue to develop for its solution. 


Our Basic Goal 


We continue to attempt to put our future work in perspective. Sections 26 and 27 are 
concerned with topics in ring theory that are analogous to the material on factor groups 
and homomorphisms for group theory. However, our aim in developing these analogous 
concepts for rings will be quite different from our aims in group theory. In group the- 
ory we used the concepts of factor groups and homomorphisms to study the structure 
of a given group and to determine the types of group structures of certain orders that 
could exist. We will be talking about homomorphisms and factor rings in Section 26 


22.11 Theorem 


Proof 
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with an eye to finding zeros of polynomials, which is one of the oldest and most funda- 
mental problems in algebra. Let us take a moment to talk about this aim in the light of 
mathematical history, using the language of “solving polynomial equations” to which 
we are accustomed. 

We start with the Pythagorean school of mathematics of about 525 B.c. The 
Pythagoreans worked with the assumption that all distances are commensurable: that 
is, given distances a and b, there should exist a unit of distance u and integers n and mm 
such that a = (n)(u) and b = (m)(u). In terms of numbers, then, thinking of wv as beings 
one unit of distance, they maintained that all numbers are integers. This idea of com- 
mensurability can be rephrased according to our ideas as an assertion that all numbers 
are rational, for if a and b are rational numbers, then each is an integral multiple of the 
reciprocal of the least common multiple of their denominators. For example, if a = + 
and b = 12, then a = (35)(%) and b = (76)(%). 

The Pythagoreans knew, of course, what is now called the Pythagorean theorem; 
that is, for a right triangle with legs of lengths a and b and a hypotenuse of length c, 


et+h=c’. 


They also had to grant the existence of a hypotenuse of a right triangle having two 
legs of equal length, say one unit each. The hypotenuse of such a right triangle would, 
as we know, have to have a length of /2. Imagine then their consternation and dis- 
may when one of their society—according to some stories it was Pythagoras himself— 
came up with the embarrassing fact that is stated in our terminology in the following 
theorem. 


The polynomial x” — 2 has no zeros in the rational numbers. Thus J2 is not a rational 
number. 


Suppose that m/n form, € Z is a rational number such that (m/n)? = 2. We assume 
that we have canceled any factors common to mm and n, so that the fraction m/n is in 
lowest terms with gcd(m, m) = 1. Then 


where both m? and 2n? are integers. Since m? and 2n? are the same integer, and since 
2 is a factor of 2n?, we see that 2 must be one of the factors of m*. But as a square. 
m? has as factors the factors of m repeated twice. Thus mm must have two factors 2. Then 
2n* must have two factors 2, so n” must have 2 as a factor, and consequently has 2 
as a factor. We have deduced from m? = 2n? that both m and n must be divisible by 2, 
contradicting the fact that the fraction m/n is in lowest terms. Thus we have 2 4 (m/n)- 
for any m,n € Z. 


Thus the Pythagoreans ran right into the question of a solution of a polynomial equa- 
tion, x” — 2 = 0. We refer the student to Shanks [36, Chapter 3], for a lively and totally 
delightful account of this Pythagorean dilemma and its significance in mathematics. 
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mw HisToricaL NOTE 


he solution of polynomial equations has been a 

goal of mathematics for nearly 4000 years. The 
Babylonians developed versions of the quadratic 
formula to solve quadratic equations. For example, 
to solve x? — x = 870, the Babylonian scribe in- 
structed his students to take half of 1 (4), square it 
(;), and add that to 870. The square root of 870}, 
namely 294, is then added to $ to give 30 as the an- 
swer. What the scribes did not discuss, however, was 
what to do if the square root in this process was not a 
rational number. Chinese mathematicians, however, 
from about 200 B.C., discovered a method similar 
to what is now called Horner's method to solve 


quadratic equations numerically, since they used 


carry out the computation to as many places as 
necessary and could therefore ignore the distinc- 
tion between rational and irrational solutions. The 
Chinese, in fact, extended their numerical tech- 
niques to polynomial equations of higher degree. 
In the Arab world, the Persian poet-mathematician 
Omar Khayyam (1048-1131) developed methods 
for solving cubic equations geometrically by find- 
ing the point(s) of intersection of appropriately cho- 
sen conic sections, while Sharaf al-Din al-Tusi (died 
1213) used, in effect, techniques of calculus to de- 
termine whether or not a cubic equation had a real 
positive root. It was the Italian Girolamo Cardano 
(1501-1576) who first published a procedure for 


solving cubic equations algebraically. 


a decimal system, they were able in principle to 


L 


In our motivation of the definition of a group, we commented on the necessity of 
having negative numbers, so that equations such as x + 2 =0 might have solutions. 
The introduction of negative numbers caused a certain amount of consternation in some 
philosophical circles. We can visualize 1 apple, 2 apples, and even rT apples, but how can 

* we point to anything and say that it is —17 apples? Finally, consideration of the equation 
x? + 1 = 0 led to the introduction of the number 7. The very name of an “imaginary 
number” given to 7 shows how this number was regarded. Even today, many students 
are led by this name to regard 7 with some degree of suspicion. The negative numbers 
were introduced to us at such an early stage in our mathematical development that we 
accepted them without question. 

We first met polynomials in high school freshman algebra. The first problem there 
was to learn how to add, multiply, and factor polynomials. Then, in both freshman algebra 
and in the second course in algebra in high school, considerable emphasis was placed 
on solving polynomial equations. These topics are exactly those with which we shall 
be concerned. The difference is that while in high school, only polynomials with real 
number coefficients were considered, we shall be doing our work for polynomials with 
coefficients from any field. 

Once we have developed the machinery of homomorphisms and factor rings in 
Section 26, we will proceed with our basic goal: to show that given any polynomial of 
degree > 1, where the coefficients of the polynomial may be from any field, we can find 
a zero of this polynomial in some field containing the given field. After the machinery 
is developed in Sections 26 and 27, the achievement of this goal will be very easy, and 
is really a very elegant piece of mathematics. 

All this fuss may seem ridiculous, but just think back in history. This is the culmi- 
nation of more than 2000 years of mathematical endeavor in working with polynomial 
equations. After achieving our basic goal, we shall spend the rest of our time studying the 
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nature of these solutions of polynomial equations. We need have no fear in approaching 
this material. We shall be dealing with familiar topics of high school algebra. This work 
should seem much more natural than group theory. 

In conclusion, we remark that the machinery of factor rings and ring homomorphisms 
is notreally necessary in order for us to achieve our basic goal. For a direct demonstration. 
see Artin [27, p. 29]. However, factor rings and ring homomorphisms are fundamental 
ideas that we should grasp, and our basic goal will follow very easily once we have 
mastered them. 


@ EXERCISES 22 


Computations 
In Exercises 1 through 4, find the sum and the product of the given polynomials in the given polynomial ring. 
1. f(x) = 4x —5, g(x) = 2x? — 4x 42 in Ze [x]. 
2. f@a=x+le@®=x4+1inZ,[x]. 
3. f(x) = 2x7 43x44, o(x) = 3x? 4+ 2x +3 in Ze[x]. 
4. f(x) = 2x3 44x? + 3x +2, o(x) = 3x7 42x +4 in Zs[z]. 
5. How many polynomials are there of degree < 3 in Z2[x]? (Include 0.) 
6 


. How many polynomials are there of degree < 2 in Zs[x]? (Include 0.) 
In Exercises 7 and 8, F = # = C in Theorem 22.4. Compute for the indicated evaluation homomorphism. 
7. d(x? +3) 8. (2x3 — x? 4+ 3x 4 2) 
In Exercises 9 through 11, F = E = Z, in Theorem 22.4. Compute for the indicated evaluation homomorphism. 


9. bs{(x4t + 2x)(x7 — 3x? + 3)] 10. f5{(x3 + 2)(4x2 + 3)? + 3x7 + 1] 
TL. b4(Bx 6 + 5x99 + 2x93) [Hint: Use Fermat’s theorem. ] 


In Exercises 12 through 15, find all zeros in the indicated finite field of the given polynomial with coefficients in 
that field. (Hint: One way is simply to try all candidates!] 


12. x7 +1linZ 13. x° +2x+2inZ 
14. x5 43x34 x2 4+2x in Zs 
15. f(x)g(x) where f(x) = x? + 2x? +5 and g(x) = 3x? +2x in Z, 


16. Let ¢g : Zs[x] —> Zs be an evaluation homomorphism as in Theorem 22.4. Use Fermat’s theorem to evaluate 
o3(x?3! 4 3xh7 = 9x33 oie 1). 


17. Use Fermat’s theorem to find all zeros in Zs of 2x?!9 + 3x74 +2457 + 3x, 


Concepts 


In Exercises 18 and 19, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 
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18. 


19 


. 


20. 


21. 


22. 
23. 


PartIV Rings and Fields 


A polynomial with coefficients in a ring R is an infinite formal sum 


CO 
ax = dy + ayx tanx? +--+ ayx” 40+ 
i=0 

where a; € R fori = 0,1,2,---. 


Let F be afield and let f(x) € F[x]. A zero of f(x)isana € F such that ¢,(f(«)) = 0, where dy : F(x) > F 
is the evaluation homomorphism mapping «x into @. 


Consider the element 
fx, y) = Bx? + 2x)y? + (x? — Ox + Dy? + (x4 — 2x)y + (x4 — 3x? +2) 


of (Q[x])[y1. Write f(x, y) as it would appear if viewed as an element of (CORESE 


Consider the evaluation homomorphism @¢5 : Q[x] + R. Find six elements in the kernel of the homomor- 
phism ¢s5. 


Find a polynomial of degree >0 in Z4[x] that is a unit. 
Mark each of the following true or false. 


a. The polynomial (a,x" +--+: + a,x +9) € R[x] is 0 if and only ifa; = 0, fori =0,1,---,n. 

b. If R is a commutative ring, then R[x] is commutative. 

c. If D is an integral domain, then D[x] is an integral domain. 

d. If R is aring containing divisors of 0, then R[x] has divisors of 0. 

e. If R is aring and f(x) and g(x) in R[x] are of degrees 3 and 4, respectively, then f(x)g(x) may 
be of degree 8 in R[x]. 

____ f. If R'is any ring and f(x) and g(x) in R[x] are of degrees 3 and 4, respectively, then f(x)g(x) is 
always of degree 7. 

g. If F is a subfield E anda € E is a zero of f(x) € F[x], then @ is a zero of A(x) = F(@)g(x) for 
all g(x) € F[x}. 

h. If F is a field, then the units in F[x] are precisely the units in F. 

i. If R is aring, then x is never a divisor of 0 in R[x]. 

j. If R is aring, then the zero divisors in R[x] are precisely the zero divisors in R. 


Theory 


24. 
25. 


26. 
27. 


Prove that if D is an integral domain, then D[x] is an integral domain. 


Let D be an integral domain and x an indeterminate. 


a. Describe the units in D[x]. 
b. Find the units in Z[x]. 
c. Find the units in Z[x]. 


Prove the left distributive law for R[x], where R is a ring and x is an indeterminate. 


Let F be a field of characteristic zero and let D be the formal polynomial differentiation map, so that 
D(a + ax + anx* +++) + a,x") = ay +2: aox te tn ayx”™!. 
a. Show that D : F[x] —~ F[x]is a group homomorphism of (F[x], +) into itself. Is D aring homomorphism? 


28. 


29. 


30. 


31. 
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b. Find the kernel of D. 
c. Find the image of F[x] under D. 
Let F be a subfield of a field £. 
a, Define an evaluation homomorphism 
Dory 1 FX, +++, Xn] > E for a; € E, 
stating the analog of Theorem 22.4. 
b. With E = F = Q, compute $_32(x17x2° + 3x1 4x9). 


c. Define the concept of a zero of a polynomial f(x1,-+++, Xn) € Flx1,+++,%,] in a way analogous to the 
definition in the text of a zero of f(x). 


Let R be aring, and let R® be the set of all functions mapping R into R. For ¢, # € R* , define the sum $ + wv 
by 
@+Wr) =o) +40) 
and the product @ : w by 
($$: WO) =eoMve) 
forr € R. Note that - is not function composition. Show that (R*, +, -) is a ring. 
Referring to Exercise 29, let F be a field. An element ¢ of F F is a polynomial function on F, if there exists 
F(x) € F[x] such that d(@@) = f(@ for alla € F. 
a. Show that the set Pr of all polynomial functions on F forms a subring of F’. 
b. Show that the ring Pr is not necessarily isomorphic to F[x]. [Hint: Show that if F is a finite field, Pr and 
F [x] don’t even have the same number of elements. ] 
Refer to Exercises 29 aiid 30 for the following questions. 
a. How many elements are there in Z2”?? in Z37?? 
b. Classify (Z2”*, +) and (Z3”, +) by Theorem 11.12, the Fundamental Theorem of finitely generated abelian 
groups. 
c. Show thatif F isa finite field, then F* = Pr. [Hint: Ofcourse, Pr © F*.Let F have aselements a), +++, dy. 
Note that if . 
file) = c= a1) + = aj) — G41) +++ & — an); 


then f;(a;) = 0 for j #i, and the value f;(a;) can be controlled by the choice of c € F. Use this to show 
that every function on F is a polynomial function.] 


FACTORIZATION OF POLYNOMIALS OVER A FIELD 


Recall that we are concerned with finding zeros of polynomials. Let E and F be fields, 
with F < E. Suppose that f(x) € F[x] factors in F[x], so that f(x) = g(x)A(x) for 
g(x), h(x) € F[x] and leta € E. Now for the evaluation homomorphism ¢,, we have 


F(@) = bal FX) = bal Z@)Ah@)) = al S*))balh(%)) = gah(a). 


Thus ifa € E, then f(@) = Oif and only if either g(@) = 0 or h(a) = 0. The attempt to 
find a zero of f(x) is reduced to the problem of finding a zero of a factor of f(x). This 
is one reason why it is useful to study factorization of polynomials. 
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Part IV 


23.1 Theorem 


Proof 


Rings and Fields 


The Division Algorithm in F[x] 


The following theorem is the basic tool for our work in this section. Note the similarity 
with the division algorithm for Z given in Theorem 6.3, the importance of which has 
been amply demonstrated. 


(Division Algorithm for F[x]) Let 
ff) = ax" + Gy x") +.--+ag 


and 


B(x) = by x™ + Dmx" | +++ + do 


be two elements of F[x], with a, and b» both nonzero elements of F and m > 0. Then 
there are unique polynomials g(x) and r(x) in F'[x] such that f(~) = g(x)q(x) + r(x), 
where either r(x) = 0 or the degree of r(x) is less than the degree m of g(x). 


Consider the set S = { f(x) — g(x)s(x) | s(x) € F[x]}. If0 € S then there exists an s(x) 
such that f(x) — g(x)s(x) = 0, so f(x) = g(x)s(x). Taking qx) = s(*) and r(x) = 0, 
we are done. Otherwise, let r(x) be an element of minimal degree in S. Then 
f(x) = gq) +7) 
for some q(x) € F[x]. We must show that the degree of r(x) is less than m. Suppose that 
r(x) = crx! +ep_yxt +---+e0, 

with c; € F ande; £0. If t > m, then 

f(x) = q(x) g(x) — (er/Bm)x'" B(x) = re) — (er/bm)x' 8), (1) 


; and the latter is of the form 


r(x) — (c.x" + terms of lower degree), 


which is a polynomial of degree lower than f, the degree of r(x). However, the polynomial 
in Eq. (1) can be written in the form 


F(x) — gx) ge) + (Cr/bm)x?™" J, 
so it is in S, contradicting the fact that r(x) was selected to have minimal degree in S. 


Thus the degree of r(x) is less than the degree m of g(x). 
For uniqueness, if 


f(x) = g(x)qi(x) + ri) 
and 
f(x) = g@ qa) + rox), 
then subtracting we have 
g(x)[gi(x) — g200)] = ra(x) — r(x). 
Because either r(x) — r,(x) = 0 or the degree of r2(x) — r1(x) is less than the degree 


of g(x), this can hold only if g(x) — q2(x) = 0 so qi (x) = qa(x). Then we must have 
r(x) — ry(x) = 0 sor (&) = ro(X). Sd 


We can compute the polynomials g(x) and r(x) of Theorem 23.1 by long division 
just as we divided polynomials in R[x] in high school. 
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23.2 Example Let us work with polynomials in Zs[x] and divide 


fx) = xt — 3x3 4203 +4x -1 


by g(®) = x? — 2x +3 to find g(x) and r(x) of Theorem 23.1. The long division should 
be easy to follow, but remember that we are in Zs[x], so, for example, 4x — (—3x) = 2x. 


2 


x°—x—3 
xt 2243] xt — 3x2 42x? 4 4x —1 
aM et 585) 
eit ie 
—x3 4+ 2x? — 3x 
— 3x? + 2x -1 
—3x7+ x-4 
x +3 
Thus 
g(x) =x? -—x-—3, and r(x) =x 43. A 


We give three important corollaries of Theorem 23.1. The first one appears in high 
school algebra for the special case F[x] = R[x]. We phrase our proof in terms of the 
mapping (homomorphism) approach described in Section 22. 


23.3 Corollary (Factor Theorem) An element a é€ F isa zero of f(x) € F [x] if and only if x — a is 
a factor of f(x) in F[v]. 


Proof Suppose that for a € F we have f(a)=0. By Theorem 23.1, there exist g(x), 
r(x) € Fx] such that 


f(x) = (& — a)g(x) + r(x), 


where either r(x) = 0 or the degree of r(x) is less than 1. Thus we must have r(x) =c 
for c € F, so 


fQ)=@-agi)t+e. 

Applying our evaluation homomorphism, @, : F[x] — F of Theorem 22.4, we find 
0 = f(a) = 0q(@) +c, 

so it must be that c = 0. Then f(x) = (x — a)q(x), so x — a isa factor of f(x). 


Conversely, if x — a is a factor of f(x) in F[x], where a € F, then applying our 
evaluation homomorpohism ¢, to f(x) = (x — a)q(x), wehave f(a) = O0g(a)=0. 
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23.4 Example Working again in Zs[x], note that 1 is a zero of 
(xt + 3x3 + 2x + 4) € Zs{x]. 


Thus by Corollary 23.3, we should be able to factor x4 + 3x3 +2x +4 into (x — Dax) 
in Zs[x]. Let us find the factorization by long division. 


x8 + 4x7 +4x +1 


x—1| x44 3x07 4+ 2x +4 


go 
4x3 
4x3 — 4x? 
4x? + 2x 
4x? — 4x 
x +4 
x—1 


0 


Thus x? + 3x? + 2x +4 = (x — 1)(x? + 4x7 + 4x + 1) in Zs[x]. Since 1 is seen to be 
a zero of x2 + 4x2 + 4x +1 also, we can divide this polynomial by x — 1 and get 


r+4 


x—1| x2 +47 +441 


et x2 


0 +4x*4+1 
4x — 4 
0 


Since x2 + 4 still has 1 as a zero, we can divide again by x — | and get 


x+1 


x—1] x? +4 


wx 


Thus x4 + 3x3 +2x+4= —1¥%@4+ 1D in Zs[r]. A 


The next corollary should also look familiar. 


23.5 Corollary A nonzero polynomial f(x) ¢ F[x] of degree n can have at most n zeros in a field F. 


Proof 


23.6 Corollary 


Proof 
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The preceding corollary shows that if a; € F is a zero of f(x), then 
F(x) = (% — aq), 


where, of course, the degree of g:(x) ism — 1. A zero a2 € F of q(x) then results in a 
factorization 


f(x) = & — ay)(X — a2)ga(x). 
Continuing this process, we arrive at 
f(x) = (& — ay) ++ @ — ar )ar&), 


where g,(x) has no further zeros in F. Since the degree of f(x) is n, at most n factors 
(x — a;) can appear on the right-hand side of the preceding equation, sor <n. Also, if 
b#a;fori=1,---,randb e¢ F, then 


f(b) = (6 — a1) ++ (8 — a, )Gr(b) F 9, 


since F has no divisors of 0 and none of b — a; or q-(b) are 0 by construction. Hence 
the a; fori = 1,---,r <n are all the zeros in F of f(x). . 


Our final corollary is concerned with the structure of the multiplicative group F* of 
nonzero elements of a field F, rather than with factorization in F'[x]. It may at first seem 
surprising that such a result follows from the division algorithm in F'[x], but recall that 
the result that a subgroup of a cyclic group is cyclic follows from the division algorithm 
in Z. 

If Gis a finite subgroup of the multiplicative group (F*, -) of a field F, then G is cyclic. 
In particular, the multiplicative group of all nonzero elements of a finite field is cyclic. 


By Theorem 11.12 as a finite abelian group, G is isomorphic to a direct product Zg, x 
Za x +++ x Zg,, where each d; is a power of a prime. Let us think of each of the Zz, asa 
cyclic group of order d; in multiplicative notation. Let m be the least common multiple 
of all the d; fori = 1,2,--+-,7; note that m < d,d2--+d,. Ifa; € Zg,, then ey =1,s0 
a; = 1 since d; divides m. Thus for all a € G, we have w” = 1, so every clement of 
G is zero of x™ — 1. But G has d,d2---d, elements, while x” — 1 can have at most m 
zeros in the field F by Corollary 23.5, so m > dd, ---d,. Hence m = djd2-+-d,, so 
the primes involved in the prime powers dj, dz, ---, d, are distinct, and the group G is 
isomorphic to the cyclic group Zn. 


Exercises 5 through 8 ask us to find all generators of the cyclic groups of units for 
some finite fields. The fact that the multiplicative group of units of a finite field is cyclic 
has been applied in algebraic coding. 


Irreducible Polynomials 


Our next definition singles out a type of polynomial in F[x] that will be of utmost 
importance to us. The concept is probably already familiar. We really are doing high 
school algebra in a more general setting. 
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23.7 Definition 


23.8 Example 


23.9 Example 


23.10 Theorem 


Proof 


Rings and Fields 


A nonconstant polynomial f(x) € F[x]is irreducible over F or is an irreducible poly- 
nomial in F [x] if f(x) cannot be expressed as a product g(x)h(x) of two polynomials 
g(x) and A(x) in F[x] both of lower degree than the degree of f(x). If f(x) € F[x] 
is a nonconstant polynomial that is not irreducible over F’, then f(x) is reducible 
over F. L_| 


Note that the preceding definition concerns the concept irreducible over F and not 
just the concept irreducible. A polynomial f(x) may be irreducible over F, but may not 
be irreducible if viewed over a larger field E containing F’. We illustrate this. 


Theorem 22.11 shows that x2 — 2 viewed in Q[x] has no zeros in Q. This shows that 
x? — 2 is irreducible over Q, for a factorization x? —2 = (ax + b)(cx + d) for a, b,c, 
d € Q would give rise to zeros of x? —2 in Q. However, x” — 2 viewed in R[x] is not 
irreducible over R, because x? — 2 factors in R[x] into (x — J2)x + /2). A 


Itis worthwhile to remember that the units in F [x] are precisely the nonzero elements 
of F. Thus we could have defined an irreducible polynomial f(x) as a nonconstant 
polynomial such that in any factorization f(x) = g(x)h(x) in F[x], either g(x) or h(x) 
is a unit. 


Let us show that f(x) = x*?+ 3x +2 viewed in Zs[x] is irreducible over Zs. If x3 + 
3x -+ 2 factored in Zs[x] into polynomials of lower degree then there would exist at 
least one linear factor of f(x) of the form x — a for some a € Zs. But then f(a) would 
be 0, by Corollary 23.3. However, f(0) = 2, fC) = 1, f(-l) = —2, f (2) = 1, and 


* f(—2) = —2, showing that f(x) has no zeros in Zs. Thus f(x) is irreducible over 


Zs. This test for irreducibility by finding zeros works nicely for quadratic and cubic 
polynomials over a finite field with a small number of elements. A 


Irreducible polynomials will play a very important role in our work from now on. 
The problem of determining whether a given f(x) € F [x] is irreducible over F may be 
difficult. We now give some criteria for irreducibility that are useful in certain cases. 
One technique for determining irreducibility of quadratic and cubic polynomials was 
illustrated in Examples 23.8 and 23.9. We formalize it in a theorem. 


Let f(x) € F[x], and let f(x) be of degree 2 or 3. Then f(x) is reducible over F if and 
only if it has a zero in F’. 


If f(x) is reducible so that f(x) = g(x)hA(@), where the degree of g(x) and the degree of 
h(x) are both less than the degree of f(x), then since f(x) is cither quadratic or cubic, 
either g(x) or h(x) is of degree 1. If, say, g(x) is of degree 1, then except for a possible 
factor in F, g(x) is of the form x — a. Then g(a) = 0, which implies that f(a) = 0, so 
f(x) has a zero in F. 

Conversely, Corollary 23.3 shows that if f(a) = 0 fora ¢ F, then x —a is a factor 
of f(x), so f(x) is reducible. ¢ 


We turn to some conditions for irreducibility over Q of polynomials in Q[x]. The 
most important condition that we shall give is contained in the next theorem. We shall 
not prove this theorem here; it involves clearing denominators and gets a bit messy. 


23.11 Theorem 


Proof 
23.12 Corollary 


Proof 


23.13 Example 


23.14 Example 


23.15 Theorem 
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If f(x) € Z[x], then f(x) factors into a product of two polynomials of lower degrees 
r and s in Q[y] if and only if it has such a factorization with polynomials of the same 
degrees r and s in Z[x]. 


The proof is omitted here. 5 


If f@)=x"4+ Gn—1x" ! +--+ +4 is in Z[x] with ap 4 0, and if f(x) has a zero in 
Q, then it has a zero m in Z, and m must divide ap. 


If f(x) has a zero a in Q, then f(x) has a linear factor x — a in Q[x] by Corollary 23.3. 
But then by Theorem 23.11, f(x) has a factorization with a linear factor in Z[x], so for 
some m € Z we must have 


fey=(- m)(x"—" tree ag/m). 


Thus dg/m is in Z, so m divides dp. ° 


Corollary 23.12 gives us another proof of the irreducibility of x? — 2 over Q, for x? — 2 
factors nontrivially in Q[x] if and only if it has a zero in Q@ by Theorem 23.10. By 
Corollary 23.12, it has a zero in Q if and only if it has a zero in Z, and moreover the only 
possibilities are the divisors +1 and +2 of 2. A check shows that none of these numbers 
is a zero of x? — 2. A 


Let us use Theorem 23.11 to show that 
; f(x) =x4 2x? 48x41 


viewed in Q[x] is irreducible over Q. If f(x) has a linear factor in Q{x], then it has a 
zero in Z, and by Corollary 23.12, this zero would have to be a divisor in Z of 1, that is, 
either +1. But f(1) = 8, and f(—1) = —8, so such a factorization is impossible. 

If f(x) factors into two quadratic factors in Q[x], then by Theorem 23.11, it has a 
factorization. 


(x? +ax + b\(x? +x 4d) 
in Z[x]. Equating coefficients of powers of x, we find that we must have 
bd=1, adt+bc=8, aet+tb+d=-2, and a+c=0 


for integers a, b,c,d € Z. From bd = 1, we see that either b = d = lorb=d=-—1. 
In any case, b =d and from ad + be = 8, we deduce that d(a + c) = 8. But this is 
impossible since a + c = 0. Thus a factorization into two quadratic polynomials is also 
impossible and f(x) is irreducible over Q. A 


We conclude our irreducibility criteria with the famous Eisenstein criterion for 
irreducibility. An additional very useful criterion is given in Exercise 37. 


(Eisenstein Criterion) Let p € Zbea prime. Suppose that f(x) = a,x" + --- + ao is 
in Z[x], and a, # 0 (mod p), but a; = 0 (mod p) for alli <n, with ap 4 0 (mod p’). 
Then f(x) is irreducible over Q. 
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Proof 


23.16 Example 


23.17 Corollary 


Proof 


Rings and Fields 


By Theorem 23.11 we need only show that f(x) does not factor into polynomials of 
lower degree in Z[x]. If 


F(x) = (b-x" +++ bo) (csx* +--+ +00) 


is a factorization in Z[x], with b, 4 0,c, # Oandr, s < n, then ay 4 0(mod p) implies 
that bo and cp are not both congruent to 0 modulo p. Suppose that bp # 0 (mod p) and 
co = 0 (mod p). Now a, # 0 (mod p) implies that b,, cs # 0 (mod p), since a, = br Cs. 
Let m be the smallest value of k such that c, # 0 (mod p). Then 


bmco ifr > m, 
Am = DoCm + Byem—-1 Fe + : 
bpCm—r fr < im. 
The fact that neither bp nor c», are congruent to 0 modulo p while c,_1, +++, co are all 
congruent to 0 modulo p implies that am # 0 modulo p, som =n. Consequently, s = n, 
contradicting our assumption that s <n; that is, that our factorization was nontrivial. 
. 4 


Note that if we take p = 2, the Eisenstein criterion gives us still another proof of 
the irreducibility of x? — 2 over Q. 


Taking p = 3, we see by Theorem 23.15 that 
25x5 — 9x4 — 3x” — 12 


“is irreducible over Q. A 


The polynomial 


xP —)1 
PGS x-l 


is irreducible over Q for any prime p. 


=xP 145? 2 4...4x41 


Again by Theorem 23.11, we need only consider factorizations in Z[x]. We remarked 
following Theorem 22.5 that its proof actually shows that evaluation homomorphims 
can be used for commutative rings. Here we want to use the evaluation homomor- 
phism ¢,4; : Qlx] > Q[x]. It is natural for us to denote oxsi(f(x)) by f@ + 1 for 
F(x) € QL]. Let 


P P\ .p-l 
Gairat + (2) +++ -b px 
@+1)-1 x ‘ 


gx) = O,@ +1) = 
The coefficient of x’~” for 0 <r < p is the binomial coefficient p!/[r!(p — r)!] which 


is divisible by p because p divides p! but does not divide either r! or (p — r)! when 
O0<r < p. Thus 


g(x) = xP T+ (f)st+- +p 


23.18 Theorem 


Proof 
23.19 Corollary 


Proof 


23.20 Theorem 


Proof 
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satisifies the Eisenstein criterion for the prime p and is thus irreducible over [. Burt 
® (x) = h(x)r(x) were a nontrivial factorization of p(x) in Z[x]. then 


®,%0+)=e@)=hot+ Drath 


would give a nontrivial factorization of g(x) in Z[x]. Thus ® ,(x) must also be irrenei7e 
over Q. Sa 


The polynomial ® , (x) in Corollary 23.17 is the p™ cyclotomic polynomial. 


Uniqueness of Factorization in F[x] 


Polynomials in F[x] can be factored into a product of irreducible polynomials in Fi] 
in an essentially unique way. For f(x), g(x) € F[x] we say that 9(x) divides f(x) in 
F [x] if there exists g(x) € F[x] such that f(x) = g(x)q¢(x). Note the similarity of the 
theorem that follows with boxed Property (1) for Z following Example 6.9. 


Let p(x) be an irreducible polynomial in F [x]. If p(x) divides r(x)s(x) for r@), s(x) € 
F [x], then either p(x) divides r(x) or p(x) divides s(x). 
We delay the proof of this theorem to Section 27. (See Theorem 27.27.) 5 


If p(x)is irreducible in F [x] and p(x) divides the product rj(x)- + - r(x) forr;(x) € Fx], 
then p(x) divides r;(x) for at least one 1. 


Using mathematical induction, we find that this is immediate from Theorem 23.18.  @ 
If Fisa field, then every nonconstant polynomial f(x) € F[x] can be factored in F[x] 


into a product of irreducible polynomials, the irreducible polynomials being unique 
except for order and for unit (that is, nonzero constant) factors in F’. 


Let f(x) € F[x] be a nonconstant polynomial. If f(x) is not irreducible, then f(x) = 
g(x)h(x), with the degree of g(x) and the degree of h(x) both less than the degree of f(x). 
If g(x) and A(x) are both irreducible, we stop here. If not, at least one of them factors 
into polynomials of lower degree. Continuing this process, we arrive at a factorization 


fe) = pi) pa(x)- ++ pr(%). 


where p;(x) is irreducible fori = 1,2,---,r. 
It remains for us to show uniqueness. Supposc that 


ff) = pi) p2(x) +++ pee) = qixga(®) ++ qs) 


are two factorizations of f(x) into irreducible polynomials. Then by Corollary 23.19, 
pi(x) divides some q;(x), let us assume g1(x). Since q(x) is irreducible, 


q(x) = ui pix), 


where u, 4 0, but uw; is in F and thus is a unit. Then substituting “1 p:(x) for gi(x) and 
canceling, we get 


D(x) +++ py(x) = U1 ga(X) +++ Gs(X). 
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By a similar argument, say g2(x) = uz p2(x), so 
p3(x)+-+ py(x) = uyurg3(x)- ++ gs(X). 
Continuing in this manner, we eventually arrive at 
1 = ug ++ Urgr4i(%)- + gs). 


This is only possible if s = r, so that this equation is actually 1] = uyu2---u,. Thus the 
irreducible factors p;(x) and qj(x) were the same except possibly for order and unit 
factors. Sd 


23.21 Example Example 23.4 shows a factorization of x* + 3x3 + 2x + 4 in Zs[x] is (x — 1P@ + D. 


These irreducible factors in Zs[x] are only unique up to units in Zs[x], that is, nonzero 
constants in Zs. For example, (x — 1° (x + 1) = (@ — 1)°(2x — 2)(x + 3). A 


EXERCISES 23 


Computations 


In Exercises 1 through 4, find g(x) and r(x) as described by the division algorithm so that f(x) = g(xq(x) +r) 
with r(x) = 0 or of degree less than the degree of g(x). 


L. f(x) = x8 + 3x5 +.4x? — 3x +2 and g(x) = x? + 2x —3 in Zy[x]. 
2 f(x) = x8 43x 44x* — 3x +2 and g(x) = 3x? + 2x —3 in Z,[x]. 
3. f(x) = 2° — 2x4 +3x — 5 and g(x) = 2x + 1in Zy [x]. 

A. f(x) = x4 4+. 5x9 — 3x? and g(x) = 5x? —x +2in Zy\ [x]. 


In Exercises 5 through 8, find all generators of the cyclic multiplicative group of units of the given finite field. 
(Review Corollary 6.16.) 


Zs 6. Z; ae ae 8. Zo; 


. The polynomial x* + 4 can be factored into linear factors in Zs[x]. Find this factorization. 
. The polynomial x3 + 2x? + 2x + 1 can be factored into linear factors in Z7[x]. Find this factorization. 
. The polynomial 2x3 + 3x2 — 7x — 5 can he factored into linear factors in Z;,[x]. Find this factorization. 


. Is x3 + 2x +3 an irreducible polynomial of Zs[x]? Why? Express it as a product of irreducible polynomials 


of Zs[x]. 


. Is 2x3 + x? + 2x +2 an irreducible polynomial in Z;[x]? Why? Express it as a product of irreducible poly- 


nomials in Zs[x]. 


. Show that f(x) = x? + 8x — 2 is irreducible over Q. Is f(x) irreducible over R? Over C? 
. Repeat Exercise 14 with g(x) = x? + 6x + 12 in place of f(x). 

. Demonstrate that x? + 3x? — 8 is irreducible over Q. 

17: 


Demonstrate that x4 — 22x? + 1 is irreducible over Q. 


In Exercises 18 through 21, determine whether the polynomial in Z[x] satisfies an Eisenstein criterion for irre- 
ducibility over Q. 
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18. x7 — 12 19, 8x? +.6x? — 9x +24 
20. 4x!° — 9x3 + 24x — 18 21, 2x!° — 25x? + 10x? — 30 


22. Find all zeros of 6x+ + 17x? + 7x? + x — 10 in Q. (This is a tedious high school algebra problem. You might 
use a bit of analytic geometry and calculus and make a graph, or use Newton’s method to see which are the 
best candidates for zeros.) 


Concepts 

In Exercises 23 and 24, correct the definition of the italicized term without reference to the text, if correction is 

needed, so that it is in a form acceptable for publication. 

23. A polynomial f(x) € F[x] is irreducible over the field F if and only if f(x) € g(x)A(x) for any polynomials 
g(x), A(x) € F[x]. 


24. A nonconstant polynomial f(x) € F [x] is irreducible over the field F if and only if in any factorization of it 
in F [x], one of the factors is in F. 


25. Mark each of the following true or false. 


a. x — 2 is irreducible over Q. 
b. 3x — 6 is irreducible over Q. 


____. ¢, x* — 3 is irreducible over Q. 

dd. x? + 3 is irreducible over Zy. 

___. e. If F is a field, the units of F[x] are precisely the nonzero elements of F’. 

f. If F is a field, the units of F[x] are precisely the nonzero elements of F. 

—__—. g. A polynomial f(x) of degree n with coefficients in a field F can have at most n zeros in F. 


—___—_h. A polynomial f(x) of degree n with coefficients in a field F can have at most n zeros in any given 
field E such that F < E. 


i, Every polynomial of degree 1 in F[x] has at least one zero in the field F. 
j. Each polynomial in F[x] can have at most a finite number of zeros in the field F. 


26. Find all prime numbers p such that x + 2 is a factor of x*# + x7 +x? -—x+1inZ,[x]. 

In Exercises 27 through 30, find all irreducible polynomials of the indicated degree in the given ring. 
27. Degree 2 in Zp[{x] 28. Degree 3 in Z)[x] 

29, Degree 2 in Zs[x] 30. Degree 3 in Z3[x] 


31. Find the number of irreducible quadratic polynomials in Z,[x], where p is a prime. [Hint: Find the number 
of reducible polynomials of the form x? + ax + b, then the number of reducible quadratics, and subtract this 
from the total number of quadratics.] 


Proof Synopsis 


32. Give a synopsis of the proof of Corollary 23.5. 
33. Give a synopsis of the proof of Corollary 23.6. 


Theory 
34, Show that for p a prime, the polynomial x? + a in Z,[x] is not irreducible for any a € Zp. 


35. If F is a field and a £0 is a zero of f(x) =a) tayx +---+a,x”" in F[x], show that 1/a is a zero of 
An + An—1X +--- + aox". 


aS °° °° °° © 
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36. (Remainder Theorem) Let f(x) € F [x] where F isa field, and leta € F. Show that the remainder r(x) when 
f(x) is divided by x — o, in accordance with the division algorithm, is f (a). 


37, Let on: 2 Zm be the natural homomorphism given by on(@) = (the remainder of a when divided by m) 
fora € Z. 
a. Show that Om : Ziv] > Zim {x] given by 
Gp(ao tax te + nx”) = Om(ao) + Fm(au)x + + Om(An)x" 
is a homomorphism of Z{x] onto Zm «1. 
b. Show that if f(x) € Z[x] and Gal f (2) both have degree n and Gf (x)) does not factor in Z,,[x] into two 
polynomials of degree less than n, then f(x) is irreducible in QL]. 


c. Use part (b) to show that x* + 17x + 36 is irreducible in Q{x]. (Aint: Try a prime value of m that simplifies 
the coefficients.] 


TNONCOMMUTATIVE EXAMPLES 


Thus far, the only example we have presented of a ring that is not commutative is the 
ring M,(F) of all n x n matrices with entries in a field F. We shall be doing almost 
nothing with noncommutative rings and strictly skew fields. To show that there are other 
important noncommutative rings occurring very naturally in algebra, we give several 
examples of such rings. 


Rings of Endomorphisms 


Let A be any abelian group. A homomorphism of A into itself is an endomorphism 


of A. Let the set of all endomorphisms of A be End(A). Since the composition of two 
homomorphisms of A into itself is again such a homomorphism, we define multiplication 
on End(A) by function composition, and thus multiplication is associative. 
To define addition, for ¢, yr € End(A), we have to describe the value of (@é + w) on 
each a € A. Define 
(p + wa) = O(a) + ¥@). 
Since 
@+ Wath) =o@+b)+ vat?) 
= [d(a) + 6) + (WO + vb)| 
= [o(a) + Wa) + [6@) + wb) 
=(¢+ Wat @+ HW) 
we see that @ + w is again in End(A). 
Since A is commutative, we have 
(g+W@ =O@+V@ = va) + $a) = +H) 


foralla€e A,sogt+V= w+¢ and addition in End(A) is commutative. The associa- 
tivity of addition follows from 


oe 


i This section is not used in the remainder of the text. 
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[6+ + 4)]@) = $(a) + [H + 4)(@)] 
= ¢(@) + [W(@) + 9a) 
= [b(a) + W(@)] + 6(@) 
=(@+ W@ + O@) 
=[+ ¥) + 4]@). 
If e is the additive identity of A, then the homomorphism 0 defined by 
O(a) =e 
for a € A is an additive identity in End(A). Finally, for 
@ € End(A), 
—@ defined by 
(—o)(a) = —¢(@) 
is in End(A), since 
(-o)(a + b) = —¢(a +b) = -16@) + 6)] 
= —$(a) — o(b) = (-6)(@) + (-6)®), 
and @ + (—@) = 0. Thus (End(A), +) is an abelian group. 

Note that we have not yet used the fact that our functions are homomorphisms except 
to show that ¢ + & and —¢@ are again homomorphisms. Thus the set AA of all functions 
from A into A is an abelian group under exactly the same definition of addition, and, 
of course, function composition again gives a nice associative multiplication in Af, 
However, we do need the fact that these functions in End(A) are homomorphisms now to 


prove the left distributive law in End(A). Except for this left distributive law, (A4, +, °) 
satisfies all the axioms for a ring. Let ¢, w, and @ be in End(A), and let a ¢ A. Then 


(6b + W)\a) = OCG + Ha) = 9(P(@) + Y@). 
Since @ is a homomorphism, 
A(b(a) + W(a)) = O(6(@)) + 9(W@)) 
= (66)(a) + Ow)a) 
= (09 + OY)(@). 
Thus 6(¢ + vw) = 0¢ + OW. The right distributive law causes no trouble, even in AA, 
and follows from 
(Cy + Oba) = (WW + OVO) = ¥(O@) + OO) 
= (Wo)(a) + (0O)a) = (Wb + O)@. 
Thus we have proved the following theorem. 


The set End(A) of all endomorphisms of an abelian group A forms a ring under homo- 
morphism addition and homomorphism multiplication (function composition). 


Again, to show relevance to this section, we should give an example showing that 
End(A) need not be commutative. Since function composition is in general not commu- 
tative, this seems reasonable to expect. However, End(A) may be commutative in some 
cases. Indeed, Exercise 15 asks us to show that End((Z, +)) is commutative. 


222 


Part IV 


24.2 Example 


24,3 Example 


Rings and Fields 


Consider the abelian group (Z x Z, +) discussed in Section 11. It is straightforward to 
verify that two elements of End((Z x Z, +)) are ¢ and y defined by 


o((m,n))=(m+n,0) and = (m,n) = (0, 0). 


Note that ¢ maps everything onto the first factor of Z x Z, and w collapses the first 
factor. Thus 


(Wo)(m, n) = (m +n, 0) = (©, 0). 
while 
(om, n) = G0, n) = @, 0). 
Hence oy 4 Wo. A 


Let F be a field of characteristic zero, and let (F [x], +) be the additive group of the 
ring F[x] of polynomials with coefficients in F. For this example, let us denote this 
additive group by F[x], to simplify this notation. We can consider End(F[x]). One 
element of End(F[x]) acts on each polynomial in F[x] by multiplying it by x. Let this 
endomorphism be X, so 


X (ao tax tax tet yx") = agx tayx? + ax? 4. + ayx"t!, 
Another element of End(F[x]) is formal differentiation with respect to x. (The familiar 
formula “the derivation of a sum is the sum of the derivatives” guarantees that differen- 
tiation is an endomorphism of F[x].) Let Y be this endomorphism, so 


¥ (ao + ax teagx® +++) + G_x") = ay + 2agx tee + nayx” |, 


Exercise 17 asks us to show that YX — XY = 1, where 1 is unity (the identity map) in 
End(F[x]). Thus XY #4 YX. Multiplication of polynomials in F[x] by any element of 
F also gives an element of End (F[x]). The subring of End(F'[x]) generated by X and Y 
and multiplications by elements of F is the Weyl algebra and is important in quantum 
mechanics. A 


Group Rings and Group Algebras 


Let G = {g; |i € 1} be any group written multiplicatively and let R be any commutative 
ring with nonzero unity. Let RG be the set of all formal sums. 


So aigi 

iel 
for a; € R and g; € G, where all but a finite number of the a; are 0. Define the sum of 
two elements of RG by 


(Dae) + (sae) = Sai + bidgi. 


iel ie! iel 
Observe that (a; + b;) = 0 except for a finite number of indices i, s0 Lic (a; + 5;) 8; 


is again in RG. It is immediate that (RG, +) is an abelian group with additive identity 
Lie 08i- 


24.4 Theorem 


24.5 Definition 


24.6 Example 


he 
7) 
ys) 
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Multiplication of two elements of RG is defined by the use of the multiplications 
in G and R as follows: 


Naively, we formally distribute the sum &;-;a;g; over the sum Dj;<;b;g; and rename a 
term a; g;byg, by ajbyg; where g;g, = g; in G. Since a; and b; are 0 for all but a finite 
number of 7, the sum Xs; gx=e)j x contains only a finite number of nonzero summands 
ajb, € R and may thus be viewed as an element of R. Again, at most a finite number of 
such sums Lg, o,~,4;b, are nonzero. Thus multiplication is closed on RG. 

The distributive laws follow at once from the definition of addition and the formal 
way we used distributivity to define multiplication. For the associativity of multiplication 


(Ze«)|(H%)(Hee)]-(Lee)]D(_F, o9)0| 


5 s onria) 


jel \ 8n8j8x=8i 


eiele~) 


Thus we have proved the following theorem. 


If G is any group written multiplicatively and R is a commutative ring with nonzero 
unity, then (RG, +, -) is aring. 


Corresponding to each g € G, we have an element lg in RG. If we identify (rename) 
1g with g, we see that (RG, -) can be considered to contain G naturally as a multiplicative 
subsystem. Thus, if G is not abelian, RG is not a commutative ring. 


The ring RG defined above is the group ring of G over R. If F is a field, then FG is 
the group algebra of G over F. | 


Let us give the addition and multiplication tables for the group algebra Z.G, where 
G = {e, a} is cyclic of order 2. The elements of Z.G are 


Oe +0a. Oetila, le+0a, and le+la. 
If we denote these elements in the obvious, natural way by 


0, a, e, and eta, 
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24.7 Table 


24.8 Table 


respectively, we get Tables 24.7 and 24.8. For example, to see that (e + a)(e +a) = 0, 


we have 


(le+la)\le+la)=04+De+04+ Da = 0e + 0a. 


This example shows that a group algebra may have 0 divisors. Indeed, this is usually the 


case. 


The Quaternions 


We have not yet given an example of a noncommutative division ring. The quaternions 


of Hamilton are the standard example of a strictly skew field; let us describe them. 


@ HIsTorICcAL NOTE 


S ir William Rowan Hamilton (1805-1865) dis- 
covered quaternions in 1843 while he was 
searching for a way to multiply number triplets 
(vectors in R*). Six years earlier he had devel- 
oped the complex numbers abstractly as pairs (a, b) 
of real numbers with addition (a,b) + (a+b) = 
(a+a',b+b') and multiplication (a, b)(a'b') = 
(aa’ — bb’, ab’ + a’b); he was then looking for an 
analogous multiplication for 3-vectors that was dis- 
tributive and such that the length of the product 
vector was the product of the lengths of the fac- 
tors. After many unsuccessful attempts to multiply 
vectors of the form a + bi + cj (where 1, i, 7 are 
mutually perpendicular), he realized while walking 


along the Royal Canal in Dublin on October 16, 
1843, that he needed a new “imaginary symbol” & 
to be perpendicular to the other three elements. He 
could not “resist the impulse ...to cut with a knife 
on a stone of Brougham Bridge” the fundamental 
defining formulas on page 225 for multiplying these 
quaternions. 

The quaternions were the first known exam- 
ple of a strictly skew field. Though many others 
were subsequently discovered, it was eventually 
noted that none were finite. In 1909 Joseph Henry 
Maclagan Wedderburn (1882-1948), then a precep- 
tor at Princeton University, gave the first proof of 
Theorem 24,10. 


Let the set H, for Hamilton, be R x R x Rx R. Now (Rx RxRxR,+) isa 
group under addition by components, the direct product of R under addition with itself 
four times. This gives the operation of addition on H. Let us rename certain elements of 
H. We shall let 


1=(1,0,0,0), i = (0,1, 0,0), 
j = (0,0,1,0), and k=(0,0,0, 1). 
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We furthermore agree to let 
a, = (a;,0,0,0), aot = (0, a2, 0, 0), 
a3j = (0,0,a3,0) and a4gk = (0, 0, 0, a4). 
Tn view of our definition of addition, we then have 
(41, 42, 43, 04) = A, + Qyi +43) + agk. 
Thus 
(a + azi +437 + agk) + (by + boi + b3j + bak) 
= (a, + by) + (a2 + b2)i + (a3 + b3)j + (ag + da)k. 
To define multiplication on H, we start by defining 
la=al=a for aceH, 
P= pP=P=-, 
and 
ij=k, jk=i, ki=j, ji=—k, kj=-i, and ik=-—j. 


Note the similarity with the so-called cross product of vectors. These formulas are easy 
to remember if we think of the sequence 


i, j,k, i, j,k. 
The product from left to right of two adjacent elements is the next one to the right. The 
product from right to left of two adjacent elements is the negative of the next one to the 


left. We then define a product to be what it must be to make the distributive laws hold, 
namely, 


(a, + agi + a3j + agk)(by + boi + b3j + bak) 
= (a,b; — agb2 — a3b3 — agbg) + (abo + aby + a3bq — a4b3)i 
+ (a1b3 — doba + a3b, + agbo)j 
+ (ayb4 + aob3 — a3b2 + agbpk. 


Exercise 19 shows that the quaternions are isomorphic to a subring of M2(C), so 
multiplication is associative. Since ij = k and ji = —k, we see that multiplication is 
not commutative, so H is definitely not a field. Turning to the existence of multiplicative 
inverses, let a = a, + agi + a3j + ak, with not all a; = 0. Computation shows that 


(a, + api + a3j + aak)(ay — Goi — a3j — a4k) = ay + ay + we + a;. 
If we let 


2 2 2 7 - : 
la? =a? t+aet+aet+a; and a =a, — doi — aaj — ak, 


Ga (3): (“)i- (45) 
ja? ja? Vay’ ae J? ae 


we see that 
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is a multiplicative inverse for a. We consider that we have demonstrated the following 
theorem. 


24.9 Theorem The quaternions H form a strictly skew field under addition and multiplication. 


Note that G = {+1, +i, +j, +k} is a group of order 8 under quaternion multiplica- 
tion. This group is generated by i and j, where 
Pat par end. jyiery, 


There are no finite strictly skew fields. This is the content of a famous theorem of 
Wedderburn, which we state without proof. 


24.10 Theorem (Wedderburn’s Theorem) Every finite division ring is a field. 


Proof See Artin, Nesbitt, and Thrall [24] for proof of Wedderburn’s theorem. 5 


@ EXERCISES 24 


Computations 


In Exercises 1 through 3, let G = {e, a, b} be a cyclic group of order 3 with identity element e¢. Write the element 
in the group algebra Z5G in the form 


re+sattb for r,5,t € Zs. 


1. Qe + 3a + Ob) + (4e + 2a 4+ 3b) 2. (2e + 3a + Ob)(4e + 2a + 3b) 3. Ge +3a + 3b) 
In Exercises 4 through 7, write the element of H in the form a; + a9i + 437 + ask fora; € R. 

4. §+3/)(44+2j —1 Cee a is 

6. +f)! 7. (1+ 347 +341 


8. Referring to the group $; given in Example 8.7, compute the product 
(Op9 + 1p, + Op2 + Op1 + lus + Les) po + Lei + 002 + Li + Cuz + 13) 
in the group algebra Z, $3. 


9, Find the center of the group (H*, -), where H* is the set of nonzero quaternions. 


Concepts 

10. Find two subsets of H different from C and from each other, each of which is a field isomorphic to C under 
the induced addition and multiplication from H. 

11. Mark each of the following true or false. 

a. M,,(F) has no divisors of 0 for any n and any field F. 

b. Every nonzero element of M2(Zz) is a unit. 

c. End(A) is always a ring with unity 4 0 for every abelian group A. 

d. End(A) is never a ring with unity # 0 for any abelian group A. 


12. 
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e. The subset Iso(A) of End(A), consisting of the isomorphisms of A onto A, forms a subring of 
End(A) for every abelian group A. 


____. f. R(Z, +) is isomorphic to (Z, +, -) for every commutative ring R with unity. 
_____ g. The group ring RG of an abelian group G is a commutative ring for any commutative ring R with 


unity. 


_____. h. The quaternions are a field. 


i. (H*, -) is a group where H* is the set of nonzero quaternions. 
j. No subring of H is a field. 
Show each of the following by giving an example. 


a. A polynomial of degree n with coefficients in a strictly skew field may have more than n zeros in the skew 
field. 


b. A finite multiplicative subgroup of a strictly skew field need not be cyclic. 


Theory 


13. 


14. 


15. 


16. 
17. 
18. 
19. 


Let @ be the element of End((Z x Z, +)) given in Example 24.2. That example showed that ¢ is a right divisor 
of 0. Show that ¢ is also a left divisor of 0. 


Show that M@>(F) has at least six units for every field F. Exhibit these units. [Hint: F has at least two elements, 
0 and 1.] 

Show that End ((Z, +-}) is naturally isomorphic to (Z, +, -) and that End({Z,, +)) is naturally isomorphic to 
(Zn, +, °)- 

Show that End((Z. x Zs, +)) is not isomorphic to (Z2 x Zo,+,-). 

Referring to Example 24.3, show that YX — XY = 1. 

If G = {e}, the group:of one element, show that RG is isomorphic to R for any ring R. 

There exists a matrix K € M>(C) such that ¢ : H — M,(C) defined by 


: ! 1 0 Oo 1 0 i 
d(a+ bit+cj +dk)=a E 4 +b E 1 tel} | +dK, 
for alla, b, c,d € R, gives an isomorphism of H with ¢[H] 
a. Find the matrix K. 


b. What 8 equations should you check to see that ¢ really is a homomorphism? 
¢c. What other thing should you check to show that ¢ gives an isomorphism of H with ¢[H]? 


tOrpERED RINGS AND FIELDS 


We are familiar with the inequality relation < on the set R and on any subset of R. (We 
remind you that relations were discussed in Section 0. See Definition 0.7.) We regard 
< as providing an ordering of the real numbers. In this section, we study orderings of 
rings and fields. We assume throughout this section that the rings under discussion have 
nonzero unity 1. 

In the real numbers, a < 6 if and only if b — a is positive, so the order relation < 
on R is completely determined if we know which real numbers are positive. We use the 
idea of labeling certain elements as positive to define the notion of order in a ring. 


t This section is not used in the remainder of the text. 
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25.1 Definition 


25.2 Example 


Rings and Fields 


An ordered ring is a ring R together with a nonempty subset P of R satisfying these 
two properties. 


Closure For all a, b € P, both a + b and ab are in P. 
Trichotomy For eacha € R, one and only one of the following holds: 


aéP, a=0, -aeP. 
Elements of P are called “positive.” a 


It is easy to see that if R is an ordered ring with set P of positive elements and S is 
a subring of R, then PO S satisfies the requirements for a set of positive elements in the 
ring S, and thus gives an ordering of S. (See Exercise 26.) This is the induced ordering 
from the given ordering of R. 

We observe at once that for each of the rings Z, Q and R the set of elements that we 
have always considered to be positive satisfies the conditions of closure and trichotomy. 
We will refer to the familiar ordering of these rings and the induced ordering on their 
subrings as the natural ordering. We now give an unfamiliar illustration. 


Let R be an ordered ring with set P of positive elements. There are two natural ways to 
define an ordering of the polynomial ring R[x]. We describe two possible sets, Plow and 
Phign, of positive elements. A nonzero polynomial in R[x] can be written in the form 


fx) = a,x" + pray ad fee tayx” 


where a, + 0 and a, 4 0, so that a,x’ and a,x” are the nonzero terms of lowest and 
highest degree, respectively. Let Pow be the set of all such f(x) for which a, ¢€ P, 
and let Prick be the set of all such f(x) for which a, € P. The closure and trichotomy 
requirements that Pioy and Phigh must satisfy to give orderings of R[x] follow at once from 
those same properties for P and the definition of addition and multiplication in R[x]. 
Illustrating in Z[x], with ordering given by Piow, the polynomial f(x) = —2x + 3x4 
would not be positive because —2 is not positive in Z. With ordering given by Phign, this 
same polynomial would be positive because 3 is positive in Z. A 


Suppose now that P is the set of positive elements in an ordered ring R. Let a be 
any nonzero element of R. Then either a or —a is in P, so by closure, a = (—-a)y is 
also in P. Thus all squares of nonzero elements of R are positive. In particular, 1 = 1? 
is positive. By closure, we see that 1 + 1 +---+ 1 for any finite number of summands 
is always in P, so it is never zero. Thus an ordered ring has characteristic zero. 

Because squares of nonzero elements must be positive, we see that the natural 
ordering of R is the only possible ordering. The positive real numbers are precisely the 
squares of nonzero real numbers and the set could not be enlarged without destroying 
trichotomy. Because 1+ 1+---+ 1 must be positive, the only possible ordering of 
Z is the natural ordering also. All ordered rings have characteristic zero so we can, by 
identification (renaming), consider every ordered ring to contain Z as an ordered subring. 

If a and b are nonzero elements of P then either —a or a is in P and either —b or 
bis in P. Consequently by closure, either ab or —ab is in P. By trichotomy, ab cannot 
be zero so an ordered ring can have no zero divisors. 

We summarize these observations in a theorem and corollary. 


25.3 Theorem 


25.4 Corollary 


25.5 Theorem 


Proof 
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Let R be an ordered ring. All squares of nonzero elements of R are positive. R has 
characteristic 0, and there are no zero divisors. 


We can consider Z to be embedded in any ordered ring R, and the induced ordering of 
Z from R is the natural ordering of Z. The only possible ordering of i& is the natural 
ordering. 


Theorem 25.3 shows that the field C of complex numbers cannot be ordered, because 
both 1 = 1? and —1 = i” are squares. It also shows that no finite ring can be ordered 
because the characteristic of an ordered ring is zero. 

The theorem that follows defines a relation < in an ordered ring, and gives properties 
of <. The definition of < is motivated by the observation that, in the real numbers, a < b 
if and only if b — a is positive. The theorem also shows that ordering could have been 
defined in terms of a relation < having the listed properties. 


Let R be an ordered ring with set P of positive elements. Let <, read “‘is less than,” be 
the relation on R defined by 


a <bifandonlyif(b-—a)eP (1) 
for a, b € R. The relation < has these properties for alla, b,c € R. 
Trichotomy One and only one of the following holds: 
a<b, a=b, b«<a. 
Transitivity Ifa <bandb <c,thena <e. 


sIsotonicity Ifb<c,thena+b<a-+te. 
If b < cand0 <a, thenab < ac and ba < ca. 


Conversely, given a relation < on a nonzero ring R satisfying these three conditions, 
the set P = {x € R|0 < x} satisfies the two criteria for a set of positive elements in 
Definition 25.1, and the relation <p defined as in Condition (1) with this P is the given 
telation <. 


Let R be an ordered ring with set P of positive elements, and leta < bmean(b—a) «€ P. 
We prove the three properties for <. 


Trichotomy Leta, b € R. By the trichotomy property of P in Definition 25.1 
applied to b — a, exactly one of 


(b-aveP, b-a=0, (a-byeP 
holds. These translate in terms of < to 
a<b, a=b, b«a 
respectively. 


Transitivity Leta < bandb <c.Then(b—a) € P and(c —b) € P. Byclo- 
sure of P under addition, we have 


(b-—a)+(c—b)=(c—ayeP 


sod <c. 


: 
: 
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Isotonicity Let b<c, so (c—b)¢ P. Then (a+c)—(a+b)=(c—be 
Psoa+b<a-+c. Also if a > 0, then by closure of P both 
a(c — b) =ac — aband(c — Da = ca — baarein P,soab < ac 
and ba < ca. 


We leave the “conversely” part of the theorem as an equally easy exercise. (See 
Exercise 27.) 5 


In view of Theorem 25.5, we will now feel free to use the < notation in an ordered 
ring. The notations >, <, and > are defined as usual in terms of < and =. Namely, 


b>ameansa <b, a< bmeanseithera = dora <b, 


a > bmeans either b < aorb=a. 


Let R be an ordered ring. It is illustrative to think what the orderings of R[x] given by 
Piow and Pyigh in Example 25.2 mean in terms of the relation < of Theorem 25.5. 

Taking Piow, we observe, for every a > Oin R, that a — x is positive sox < a. Also, 
x =x — Ois positive, so 0 < x. Thus 0 < x < a for every a € R. We have (x! — x/) € 
Prow when i < j,sox/ <x! ifi < j. Our monomials have the ordering 


O<- x cP axt cx ax? <x <a 


for any positive a € R. Taking R = R, we see that in this ordering of R[x] there are 
infinitely many positive elements that are less than any positive real number! 

We leave a similar discussion of < for the ordering of R[x] given by Phign to 
Exercise 1. A 


The preceding example is of interest because it exhibits an ordering that is not 
Archimedian. We give a definition explaining this terminology. Remember that we can 
consider Z to be a subring of every ordered ring. 


An ordering of a ring R with this property: 


For each given positive a and b in R, there exists a positive integer n such that 
na > b, 


is an Archimedian ordering. a 


The natural ordering of R is Archimedian, but the ordering of R[x] given by Prow 
discussed in Example 25.6 is not Archimedian because for every positive integer n we 
have (17 — nx) € Pow, sonx < 17 foralln € Zt. 

We give two examples describing types of ordered rings and fields that are of interest 
in more advanced work. 


(Formal Power Series Rings) Let R be a ring. In Section 22 we defined a polynomial 
in R[x] to be a formal sum ear a;x' where all but a finite number of the a; are 0. If 
we do not require any of the a; to be zero, we obtain a formal power series in x with 
coefficients in the ring R. (The adjective, formal, is customarily used because we are not 
dealing with convergence of series.) Exactly the same formulas are used to define the 
sum and product of these series as for polynomials in Section 22. Most of us had some 
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practice adding and multiplying series when we studied calculus. These series form a 
ring which we denote by R[[x]], and which contains R[x] as a subring. 

If R is an ordered ring, we can extend the ordering to R[[x]] exactly as we extended 
the ordering to R[x] using the set Pio, of positive elements. (We cannot use Phich. Why 
not?) The monomials have the same ordering that we displayed in Example 25.6. A 


(Formal Laurent Series Fields) Continuing with the idea of Example 25.8, we let F 
be a field and consider formal series of the form )>;~ y aix' where N may be any integer, 
positive, zero, or negative, and a; € F. (Equivalently, we could consider )°7"_,, aix' 
where all but a finite number of the a; are zero for negative values of i. In studying calcu- 
lus for functions of a complex variable, one encounters series of this form called “Laurent 
series.”) With the natural addition and multiplication of these series, we actually have a 
field which we denote by F ((x)). The inverse of x is the series x! +0 + Ox + 0x7 +--+. 
Inverses of elements and quotients can be computed by series division. We compute three 
terms of (@7! — 1 +x —x?2 4x3 +-.)/(c? + 2x4 4+ 3x5 +--+ +) in R((x)) for illustra- 
tion. 


x4 — 3x3 4 4x74... 


xt-14 x-e txete 
x1 4+24+3x4+ -- 


42x48 + 3x7 405: 


Boys a8 
~3—6x — 9x? +... 
4x + we hy 


If F is an ordered field, we can use the obvious analog of Piow in R[[x]] to define 
an ordering of F((x)). In Exercise 2 we ask you to symbolically order the monomials 
eee 3 2 xt 9 = 1, x, x2, x3, --- as we did for R[x] in Example 25.6. Note that 
F((x)) contains, as a subfield, a field of quotients of F [x], and thus induces an ordering 
on this field of quotients. A 


Let R be an ordered ring and let @ : R > R’ be aring isomorphism. It is intuitively 
clear that by identification (renaming), the map ¢ can be used to carry over the ordering 
of R to provide an ordering of R’. We state as a theorem what would have to be proved 
for a skeptic, and leave the proof as Exercise 25. 


Let R be an ordered ring with set P of positive elements and let 6: R > R’ be aring 
isomorphism. The subset P’ = $[P] satisfies the requirements of Definition 25.1 for a 
set of positive elements of R’. Furthermore, in the ordering of R’ given by P’, we have 
o(a) <' &(b) in R’ if and only if a < bin R. 


We call the ordering of R’ described in the preceding theorem the “ordering induced 
by” ¢ from the ordering of R. 
Example 22.9 stated that the evaluation homomorphism ¢, : Q[x] — R where 
(ao + ax + +++ + anx") = ag bam +++ aye" 
is one to one. Thus it provides an isomorphism of Q[x] with d[Q[x]]. We denote this 
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image ring by Q[z]. If we provide Q[x] with the ordering using the set Pioy of Ex- 
amples 25.2 and 25.6, the ordering on Q[z] induced by ¢, is very different from that 
induced by the natural (and only) ordering of R. In the P,oy, ordering, 7 is less than any 
element of Q! A 


An isomorphism of a ring & onto itself is called an automorphism of R. Theo- 
rem 25.10 can be used to exhibit different orderings of an ordered ring R if there exist 
automorphisms of R that do not carry the set P of positive elements onto itself. We give 
an example. 


Exercise 11 of Section 18 shows that {m + nJ2 |m,n € Z} isa ring. Let us denote this 
ring by Z[/2]. This ring has a natural order induced from R in which J/2 is positive. 
However, we claim that : ZV2] > Zi /2} defined by o(m + nV2) =m —nv2isan 
automorphism. It is clearly one to one and onto ZIV2]. We leave the verification of the 
homomorphism property to Exercise 17. Because o(./2) = —/2, we see the ordering 
induced by ¢ will be one where —4/2is positive! In the natural order on Z{V2], anelement 
m +n,/2 is positive if m and n are both positive, or if m is positive and 2n? < m?, orif 
n is positive and m” < 2n*. In Exercise 3, we ask you to give the analogous descriptions 
for positive elements in the ordering of Z{V2] induced by ¢. A 


In view of Examples 25.11 and 25.12, which exhibit orderings on subrings of R that 
are not the induced orderings, we wonder whether Q can have an ordering other than the 
natural one. Our final theorem shows that this is not possible. 


Let D be an ordered integral domain with P as set of positive elements, and let F be a 


‘field of quotients of D. The set 


P'={x € F|x =a/b fora, b € Dandab e€ P} 


is well-defined and gives an order on F that induces the given order on D. Furthermore, 
P’ is the only subset of F with this property. 


To show that P’ is well-defined, suppose that x = a/b = a’/b’ for a, b, a’, b' € D and 
that ab ¢ P. We must show that a’b’ € P. From a/b = a'/b' we obtain ab’ = a’'b. 
Multiplying by b, we have (ab)b’ = a'b*. Now b? € P and by assumption, ab € P. 
Using trichotomy and the properties a(—b) = (—a)b = —(ab) of a ring, we see that 
either a’ and D’ are both in P or both not in P. In either case, we have a’b’ € P. 

We proceed to closure for P’. Let x = a/b and y = c/d be two elements of P’, so 
ab € Pandcd € P.Nowx + y = (ad + bc)/bd and (ad + be)bd = (ab)d* + b?(cd) 
is in P because squares are also in P and P is closed under addition and multiplication. 
Thus (x + y) € P’. Also xy = ac/bd is in P’ because achd = (ab)(cd) is a product of 
two elements of P and thus in P. 

For trichotomy, we need only observe that for x = a/b, the product ab satisfies just 
one of 


abe P, ab =0, ab¢P 
by trichotomy for P. For P’, these translate intox € P’, x =0,andx ¢ P’, respectively. 
We have shown that P’ does give an ordering of F’. Fora € D, we see thata = a/1 


is in P’ if and only if al =a is in P, so the given ordering on D is indeed the induced 
ordering from F by P’. 
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Finally, suppose that P” is a set of positive elements of F satisfying the conditions 
of Definition 25.1 and such that P” M D = P.Letx = a/b € P” wherea, b € D. Then 
xb? = ab must be in P”, so ab € (P” MD) = P. Thus x € P’ so P” C P’. The law of 
trichotomy shows that we then must have P’ = P”. Therefore P’ gives the only ordering 
of F that maintains original order for elements of D. e 


m@ EXERCISES 25 


Computations 


1. Let R be an ordered ring. Describe the ordering of a positive element a of R and the monomials x, Sane ees tee 
in R[x] as we did in Example 25.6, but using the set Phigh Of Example 25.6 as set of positive elements of R[x]. 


2. Let F be an ordered field and let F((x)) be the field of formal Laurent series with coefficients in F’, discussed in 
Example 25.9. Describe the ordering of the monomials --- x73, x7?, x7! x° = 1, x, x?, x°, --- in the ordering 
of F((x)) described in that example. 


3. Example 25.12 described an ordering of Z[V2] = {m +nV2|m,n € Z} in which —V2 is positive. Describe, 
in terms of m and n, all positive elements of Z[./2] in that ordering. 


In Exercises 4 through 9, let R[x] have the ordering given by 
i. Piow ii. Phish 


as described in Example | 25. 2. In each case (i) and (ii), list the labels a, b, c, d, e of the given polynomials in an 
order corresponding to increasing order of the polynomials as described by the relation < of Theorem 25.5. 


4. a, —54 3x b. 5 — 3x c. —x + 7x? d. x — 7x? e. 2+ 4x? 

5. a. —1 b. 3x — 8x3 ce. —5x+7x2-—A1xt ) di 8x? 4° e. —3x3 — 4x9 
6. a. —3 + 5x? b. -2x + 5x? +27 & -5 d. 6x? + 8x4 e. 8x4 —5x° 
Toa, —2x7 4+5x3 b. x7 + 4x4 ce. 2x — 3x? d. —3x — 4x? e. 2x — 2x? 

8. a. 4x — 3x? b. 4x + 2x? c. 4x — 6x? d. 5x — 6x3 e. 3x — 2x? 

9 a. x —3x?+5x7 b. 2—3x*+5x3 c. x — 3x* 4+ 4x3 dg. x 43x74 4x4 e@. x 43x? -— 433 


In Exercises 10 through 13, let Q((x)) have the ordering described in Example 25.9. List the labels a, b, c, d, e of 
the given elements in an order corresponding to increasing order of the elements as described by the relation < of 
Theorem 25.5. 


1 —5 2 —3 
10. a —- b= a — d. os ei 
x x x x 
1 2 1 —Xx 3-2. 
ll. a bee c. de peace wil 
1—x l+x x — x2 1+x x3 44x 
5-7 —24+4 7+2 — 3x? 3-5 
22. igo apes rig ii . 


a. ———~ . ——— ‘ Reena e, ———_ 
x2 + 3x3 4 — 3x 4 — 3x * 24 6x —6+2x 
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Concepts 
14, It can be shown that the smallest subfield of R containing </2 is isomorphic to the smallest subfield of C 
containing 4 it ). Explain why this shows that, although there is no ordering for C, there may be an 
ordering of a subfield of C that contains some elements that are not real numbers. 
15. Mark each of the following true or false. 
a. There is only one ordering possible for the ring Z. 
b. The field R can be ordered in only one way. 
c, Any subfield of R can be ordered in only one way. 
d. The field Q can be ordered in only one way. 
e. If R is an ordered ring, then R[x] can be ordered in a way that induces the given order on R. 
____ f, Anordering of aring R is Archimedian if for each a, b € R, there existsn € Z* such that b < na. 
g. An ordering of a ring R is Archimedian if for each a, b € R such that 0 <a, there exists n € Zt 
such that b < na. 
h. If R is an ordered ring and a € R, then —a cannot be positive. 
i. If R is an ordered ring and a € R, then either a or —a is positive. 
j. Every ordered ring has an infinite number of elements. 
16. Describe an ordering of the ring Q[z], discussed in Example 25.11, in which mw is greater than any rational 
number. 
Theory 
17. Referring to Example 25.12, show that the map ¢ : ZV2] — R where d(m + nJ/2) =m —nV2 is ahomo- 


morphism. 


In Exercises 18 through 24, let R be an ordered ring with set P of positive elements, and let < be the relation on 
R defined in Theorem 25.5. Prove the given statement. (All the proofs have to be in terms of Definition 25.1 and 
Theorem 25.5. For example, you must not say, “We know that negative times positive is negative, so ifa < Oand 
0 < bthen ab < 0.”) 


18. 
19. 
20. 
21. 
22. 
23. 
24. 
25. 
26. 


27. 


Ifa € P, then 0 < a. 

Ifa, b € P and ac = bd, then either c = d =Oored € P. 
Ifa < b, then —b < —a. 

Ifa < Oand0 < db, then ab < 0. 

If R is a field and a and b are positive, then a/b is positive. 
If R is a field andO < a < 1, then1 < 1/a. 

If R isa field and —1 < a < 0, then 1/a < —-1. 

Prove Theorem 25.10 of the text. 


Show that if R is an ordered ring with set P of positive elements and S is a subring of R, then PM S satisfies 
the requirements for a set of positive elements in the ring S, and thus gives an ordering of S. 


Show that if < is a relation on a ring R satisfying the properties of trichotomy, transitivity, and isotonicity 
stated in Theorem 25.5, then there exists a subset P of R satisfying the conditions for a set of positive elements 


28. 


29. 
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in Definition 25.1, and such that the relation <p defined by a <p b if and only if (6 — a) € P is the same as 
the relation <. 


Let R be an ordered integral domain. Show that if a?"+! — 5?"+! where a, b € R and n is a positive integer, 
then a = b. 


Let R be an ordered ring and consider the ring R[x, y] of polynomials in two variables with coefficients in R. 
Example 25.2 describes two ways in which we can order R[x], and for each of these, we can continue on and 
order (R[x])[y] in the analogous two ways, giving four ways of arriving at an ordering of R[x, y]. There are 
another four ways of arriving at an ordering of R[x, y] if we first order R[y] and then (R[y])[x]. Show that 
all eight of these orderings of R[x, y] are different. [Hint: You might start by considering whether x < y or 
y < x in each of these orderings, and continue in this fashion.] 
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HoMOMORPHISMS AND Factor RINGS 


Homomorphisms 


We'defined the concepts of homomorphism and isomorphism for rings in Section 18, 
since we wished to talk about evaluation homomorphisms for polynomials and about 
isomorphic rings. We repeat some definitions here for easy reference. Recall that a 
homomorphism is a structure-relating map. A homomorphism for rings must relate both 
their additive structure and their multiplicative structure. 


A map ¢ of aring R into a ring R’ is a homomorphism if 
ola + b) = ofa) + b(b) 
and 
b(ab) = o(a)b(b) 
for all elements a and b in R. a 
In Example 18.10 we defined evaluation homomorphisms, and Example 18.11 
showed that the map ¢: Z— Z,, where @(m) is the remainder of m when divided 


by n, is ahomomorphism. We give another simple but very fundamental example of a 
homomorphism. 


(Projection Homomorphisms) Let R,, R2,---, Ry» be rings. For each i, the map 7; : 
R, xX Rp X +++ x Ry, > R; defined by 7; (rj, r2, +--+, 7.) = 7; is a homomorphism, pro- 
jection onto the ith component. The two required properties of a homomorphism hold 


1 Section 28 is not required for the remainder of the text. 
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for 7; since both addition and multiplication in the direct product are computed by 
addition and multiplication in each individual component. A 


Properties of Homomorphisms 


We work our way through the exposition of Section 13 but for ring homomorphisms. 


(Analogue of Theorem 13.12) Let ¢ be ahomomorphism of a ring R into a ring Rif 
0 is the additive identity in R, then (0) = 0’ is the additive identity in R’, andifa < R, 
then ¢(—a) = —@(a). If S is a subring of R, then $[S] is a subring of R’. Going the 
other way, if S’ is a subring of R’, then ¢7!{[S’] is a subring of R. Finally, if R has unity 
1, then #(1) is unity for @[R]. Loosely speaking, subrings correspond to subrings, and 
rings with unity correspond to rings with unity under a ring homomorphism. 


Let @ be a homomorphism of a ring R into a ring R’. Since, in particular, @ can be 
viewed as a group homomorphism of (R, +) into (R’, +’), Theorem 13.12 tells us that 
~(0) = 0’ is the additive identity element of R’ and that ¢(—a) = —$(@). 

Theorem 13.12 also tells us that if § is a subring of R, then, considering the additive 
group (S, +), the set (@[S], +’) gives a subgroup of (R’, +’). If ¢(s)) and (sz) are two 
elements of @[.5], then 


(81 )b(s2) = O(5152) 


and (5152) € @[S]. Thus 6(s,)@(s2) € [1S], so O[S] is closed under multiplication. 
Consequently, @[S] is a subring of R’. 

Going the other way, Theorem 13.12 also shows that if 5’ is a subring of R’, then 
(@—'[S’}, +) isa subgroup of (R, +). Leta, b € ~'{S’], sothat O(a) € S’ and@(b) € S’. 
Then 

dab) = $(a)gd). 


Since o(a)@(b) € S’, we see that ab € ¢[S’] so @—1[8’] is closed under multiplication 
and thus is a subring of R. 
Finally, if R has unity 1, then for allr € R, 


br) = br) = () = (DO) = oe), 
so @(1) is unity for d[R]. 5 


Note in Theorem 26.3 that @(1) is unity for @[R], but not necessarily for R’ as we 
ask you to illustrate in Exercise 9, 


Let amap @ : R > R be a homomorphism of rings. The subring 
o'[0'] = fr € RIG) =0} 
is the kernel of ¢, denoted by Ker(¢). |_| 


Now this Ker(@#) is the same as the kernel of the group homomorphism of (R, +) 
into (R’, +) given by ¢. Theorem 13.15 and Corollary 13.18 on group homomorphisms 
give us at once analogous results for ring homomorphisms. 
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(Analogue of Theorem 13.15) Let @: R — R’ be a ring homomorphism, and let 
H = Ker(o). Let a € R. Then 6 | [6(a)] =a +H =A+a, whrea+H=H+a 
is the coset containing a of the commutative additive group (H, +}. 


(Analogue of Corollary 13.18) A ring homomorphism ¢ : R — R’ is a one-to-one 
map if and only if Ker(¢) = {O}. 


Factor (Quotient) Rings 


We are now ready to describe the analogue for rings of Section 14. We start with the 
analogue of Theorem 14.1. 


(Analogue of Theorem 14.1) Let@: R — R’ bearing homomorphism with kernel H. 
Then the additive cosets of H forma ring R/H whose binary operations are defined by 
choosing representatives. That is, the sum of two cosets is defined by 


(@+H)+(6+H)=@+b)+H, 
and the product of the cosets is defined by 
(a+ H)(b + H) = (ab) + H. 
Also, the map w : R/H — ¢[R] defined by w(a + H) = (a) is an isomorphism. 


Once again, the additive part of the theory is done for us in Theorem 14.1. We proceed 
to check the multiplicative aspects. 

We must first show that multiplication of cosets by choosing representatives is well 
defined. To this end, let #;, h2, € H and consider the representatives a +h, of a+ H 
and b+ hz of b+ H. Let 


c= (a + hy)(b + hz) =ab+ ahy +h,b + hyho. 


We must show that this element c lies in the cosetab + H.Sinceab+ H = 67 ![(ab)], 
we need only show that @(c) = ¢(ab). Since ¢ is a homomorphism and ¢(h) = 0’ for 
h € H, we obtain 


O(c) = (ab + ahy + hb + hyh2) 
= G(ab) + d(ahz) + b(yb) + (hy hz) 
= o(ab) + o(a)0' + 0'6(b) + 0'0' 
= o(ab) + 0'+0'+0' = o(ab). (1) 


Thus multiplication by choosing representatives is well defined. 

To show that R/H is a ring, it remains to show that the associative property for 
multiplication and the distributive laws hold in R/H. Since addition and multiplica- 
tion are computed by choosing representatives, these properties follow at once from 
corresponding properties in R. 

Theorem 14.1 shows that the map yu defined in the statement of Theorem 26.4 is well 
defined, one to one, onto ¢[R], and satisfies the additive property for a homomorphism. 
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Multiplicatively, we have 
ula + H)(b + H)| = wlab+ A) = d(ab) 
= $(a)b(0) = wa + H)ulb + A). 


This completes the demonstration that jz is an isomorphism. ° 


Example 18.11 shows that the map ¢ : Z — Z, defined by @(m) = r, where r is the re- 
mainder of m when divided by n, is ahomomorphism. Since Ker(@) = nZ, Theorem 26.7 
shows that Z/nZ is aring where operations on residue classes can be computed by choos- 
ing representatives and performing the corresponding operation in Z. The theorem also 
shows that this ring Z/nZ is isomorphic to Z,. A 


It remains only to characterize those subrings H of a ring R such that multipli- 
cation of additive cosets of H by choosing representatives is well defined. The coset 
multiplication in Theorem 26.7 was shown to be well defined in Eq. (1). The success of 
Eq. (1) is due to the fact that @(ah2) = @(hib) = b(hyh2) = O'. That is, if h € A where 
AT = Ker(@), then for every a, b € R we have ah € H andhb € H. This suggests Theo- 
rem 26.9 below, which is the analogue of Theorem 14.4. 


(Analogue of Theorem 14.4) Let H be a subring of the ring R. Multiplication of 
additive cosets of H is well defined by the equation 


(a+ H)\(b+H)=ab+H 


if and only ifah € H andhb € A foralla,be Randhe H. 


Suppose first thatah € H andhb € H foralla,b € Randallh € H.Lethi, ho € Hso 
that a +h, andb + hz are also representatives of the cosetsa + H andb+ AH containing 
a and b. Then 


(ath))(b+h.) =ab+aho+hb+hyho. 


Since ahz and h)b and hyh> are allin H by hypothesis, we see that (a + h1)(b + ho) € 
ab+dH. 

Conversely, suppose that multiplication of additive cosets by representatives is well 
defined. Let a € R and consider the coset product (a + H)H. Choosing representatives 
a é€(a+4H) and 0 € H, we see that (a+ H)H =a0+ 4H =0+H4H —H. Since we 
can also compute (a4 + H)H by choosing a € (a+ H) and any h € H, we see that 
ah € H for any h € H. A similar argument starting with the product H(b + H) shows 
that hb € H foranyh € H. 4 


In group theory, normal subgroups are precisely the type of substructure of groups 
required to form a factor group with a well-defined operation on cosets given by operating 
with chosen representatives. Theorem 26.9 shows that in ring theory, the analogous 
substructure must be a subring H of a ring R such that aH C H and HbC H for 
alla, b € R, whereaH = {ah|h € H} and Hb = {hb|h € HA}. From now on we will 
usually denote such a substructure by N rather than H. Recall that we started using N 
to mean a normal subgroup in Section 15. 
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26.10 Definition An addive subgroup N of aring R satisfying the properties 
aNCoN and NbCN foralla,be R 
is an ideal. | 
26.11 Example We see that Z is an ideal in the ring Z since we know it is a subring, and s(nm) = 
(nm)s = n(ms) € nZ for alls € Z. A 
26.12 Example Let F be the ring of all functions mapping R into R, and let C be the subring of F 
consisting of all the constant functions in F’. Is C an ideal in F'? Why? 
Solution tis not true that the product of a constant function with every function is again a constant 


function. For example, the product of sin x and 2 is the function 2 sin x. Thus C is not 


an ideal of F. 


w@ HistoricaL NOTE 


[ was Ernst Eduard Kummer (1810-1893) who 
introduced the concept of an “ideal complex 
number” in 1847 in order to preserve the notion 
of unique factorization in certain rings of alge- 
braic integers. In particular, Kummer wanted to 
be able to factor into primes numbers of the form 
dg + ayo + anat? +--+ apa", where @ is a 
complex root of x? = 1 (p prime) and the a; are 
ordinary integers. Kummer had noticed that the 
naive definition of primes as “unfactorable num- 
bers” does not lead to the expected results; the prod- 
uct of two such “unfactorable” numbers may well 
be divisible by other “unfactorable” numbers. Kum- 
mer defined “ideal prime factors” and “ideal num- 
bers” in terms of certain congruence relationships; 
these ‘ideal factors” were then used as the divisors 


necessary to preserve unique factorization. By use 
of these, Kummer was in fact able to prove cer- 
tain cases of Fermat’s Last Theorem, which states 
that x7 + y” = z” has no solutions x, y,z € Z* if 
n>2. 

It turned out that an “ideal number,” which was 
in general not a “number” at all, was uniquely de- 
termined by the set of integers it “divided.” Richard 
Dedekind took advantage of this fact to identify the 
ideal factor with this set; he therefore called the set 
itself an ideal, and proceeded to show that it satis- 
fied the definition given in the text. Dedekind was 
then able to define the notions of prime ideal and 
product of two ideals and show that any ideal in the 
ring of integers of any algebraic number field could 
be written uniquely as a product of prime ideals. 


26.13 Example Let F be as in the preceding example, and let N be the subring of all functions f such 
that f(2) = 0. Is N an ideal in F? Why or why not? 
Solution Let f € Nandletg € F.Then(fg)(2) = f(2)g(2) = 0g(2) = 0, so fg € N. Similarly, 


we find that gf € N. Therefore N is an ideal of F. We could also have proved this by 
just observing that N is the kernel of the evaluation homomorphism ¢2:F > R. A 
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26.14 Corollary 


26.15 Definition 


26.16 Theorem 


Proof 


26.17 Theorem 


Proof 
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Once we know that multiplication by choosing representatives is well defined on 
additive cosets of a subring N of R, the associative law for multiplication and the 
distributive laws for these cosets follow at once from the same properties in R. We have 
at once this corollary of Theorem 26.9. 


(Analogue of Corollary 14.5) Let N be an ideal of aring R. Then the additive cosets 
of N forma ring R/N with the binary operations defined by 
(a+N)+(+N)=(at+tb)+N 
and 
(a+N\b+N)=ab+N. 


The ring R/N in the preceding corollary is the factor ring (or quotient ring) of R 
by N. | 


If we use the term quotient ring, be sure not to confuse it with the notion of the field 
of quotients of an integral domain, discussed in Section 21. 


Fundamental Homomorphism Theorem 


To complete our analogy with Sections 13 and 14, we give the analogues of Theorems 14.9 
and 14.11. 


(Analogue of Theorem 14.9) Let N be an ideal of aring R. Theny : R ~ R/N given 


. by y(x) = x +N isa ring homomorphism with kernel N. 


The additive part is done in Theorem 14.9. Turning to the multiplicative question, we 
see that 


yxy) = (xy) +N = (+N) +N) =y@)r(y). ¢ 


(Fundamental Homomorphism Theorem; Analogue of Theorem 14.11) Let ¢: 
R= R’ be a ring homomorphism with kernel N. Then $[R] is a ring, and the map 
wi R/N > 6[R]givenby ua +N) = (x) isanisomorphism. Ify : R > R/N isthe 
homomorphism given by y(x) =x +N, then for each x € R, we have (x) = py (x). 


This follows at once from Theorems 26.7 and 26.16. Figure 26.18 is the analogue of 
Fig. 14.10. Sd 


R > o[R] 


R/N 


26.18 Figure 
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26.19 Example Example 26.11 shows that nZ is an ideal of Z, so we can form the factor ring 2 nZ. 
Example 18.11 shows that ¢ : Z — Z, where ¢(m) is the remainder of m modulo n is a 
homomorphism, and we see that Ker(@) = nZ. Theorem 26.17 then shows that the map 
uw: Z/nZ > Z, where (m + nZ) is the remainder of m modulo n is well defined and 
is an isomorphism. A 


In summary, every ring homomorphism with domain R gives rise to a factor ring 
R/N, and every factor ring R/N gives rise to a homomorphism mapping R into RN. 
An ideal in ring theory is analogous to a normal subgroup in the group theory. Both are 
the type of substructure needed to form a factor structure. 

We should now add an addendum to Theorem 26.3 on properties of homomorphisms. 
Let ¢ : R — R’ be a homomorphism, and let N be an ideal of R. Then ¢[N] is an ideal 
of @[R], although it need not be an ideal of R’. Also, if N’ is an ideal of either @[R] or 
of R’, then @~'{N’"] is an ideal of R. We leave the proof of this to Exercise 22. 


@ EXERCISES 26 


Computations 
1. Describe all ring homomorphisms of Z x Z into Z x Z. [Hint: Note that if ¢ is such a homomorphism, then 
(CA, 0)) = 61, 0))P(CL, 0) and G((O, 1F)) = G(CO, 1))G(O, 1). Consider also (1, 0), 1)).] 
2. Find all positive integers n such that Z, contains a subring isomorphic to Z. 


3. Find all ideals N of Z,2. In each case compute Z12/N; that is, find a known ring to which the quotient ring is 
isomorphic. : 


4. Give addition and multiplication tables for 2Z/8Z. Are 2Z/8Z and Z, isomorphic rings? 


Concepts 
In Exercises 5 through 7, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 

5. An isomorphism of a ring R with aring R’ is a homomorphism ¢ : R — R’ such that Ker(¢) = {0}. 


6. Anideal N of aring R is an additive subgroup of (R, +) such that for allr € R andalln € N, wehavern € N 
andnr EN. 


7. The kernel of a homomorphism @ mapping a ring R into aring R’ is {@(r) = 0’ |r € R}. 


8. Let F be the ring of all functions mapping R into R and having derivatives of all orders. Differentiation gives 
amap 6: F > F where 6(f(x)) = f(x). Is 5 a homomorphism? Why? Give the connection between this 
exercise and Example 26.12. 


9. Give an example of a ring homomorphism @ : R > R’ where R has unity 1 and (1) 4 0’, but @(1) is not 
unity for R’. 


10. Mark each of the following true or false. 


a. The concept of a ring homomorphism is closely connected with the idea of a factor ring. 
b. A ring homomorphism ¢ : R — R’ carries ideals of R into ideals of R’. 


ce. A ring homomorphism is one to one if and only if the kernel is {0}. 
d. Q is an ideal in R. 
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11. 


12. 
13. 
14. 
15. 
16. 


Part V_ Ideals and Factor Rings 


_____ e. Every ideal in a ring is a subring of the ring. 

______ f, Every subring of every ring is an ideal of the ring. 

______g, Every quotient ring of every commutative ring is again a commutative ring. 

______ h. The rings Z/4Z and Z, are isomorphic. 

i. An ideal N in aring R with unity 1 is all of R if and only ifle N. 

j. The concept of an ideal is to the concept of a ring as the concept of a normal subgroup is to the 
concept of a group. 

Let R be a ring. Observe that {0} and R are both ideals of R. Are the factor rings R/R and R/{0} of real 

interest? Why? 


Give an cxample to show that a factor ring of an integral domain may be a field. 

Give an example to show that a factor ring of an integral domain may have divisors of 0. 

Give an example to show that a factor ring of a ring with divisors of 0 may be an integral domain. 
Find a subring of the ring Z x Z that is not an ideal of Z x &. 


A student is asked to prove that a quotient ring of a ring R modulo an ideal NV is commutative if and only if 
(rs — sr) € N forallr,s € R. The student starts out: 

Assume R/N is commutative. Then rs = sr for allr,s € R/N. 

a. Why does the instructor reading this expect nonsense from there on? 

b. What should the student have written? 

c. Prove the assertion. (Note the “if and only if.”) 


Theory 


17, 


18. 
19. 


20. 


21. 


22. 


23. 


24, 
25. 


Let R = (a+ bvZ a,b €Z) andlet R consist of all 2 x 2matrices of the form [? 7%] fora, b € Z. Show 
that R is a subring of R and that R’ is a subring of M)(Z). Then show that ¢ : R — R’, where o(a + b./2) = 
FE: ma is an isomorphism. 

Show that cach homomorphism from a field to a ring is either one to one or maps everything onto 0. 


Show that if R, R’, and R” are rings, and if @: R > R’ and y: R’ > R" are homomorphisms, then the 
composite function wv : R > R” is ahomomorphism. (Use Exercise 49 of Section 13.) 


Let R be a commutative ring with unity of prime characteristic p. Show that the map ¢, : R > R given by 
g,(a) = a? isa homomorphism (the Frobenius homomorphism). 


Let R and R’ be rings and let ¢ : R — R’ be aring homomorphism such that @[R] 4 {0’}. Show that if R has 
unity 1 and R’ has no 0 divisors, then (1) is unity for R’. 


Let @: R > R’ be ating homomorphism and let NV bé an ideal of R. 
a. Show that @[N] is an ideal of @[R]. 


b. Give an example to show that [NV] need not be an ideal of R’. 
c. Let N’ be an ideal either of @[R] or of R’. Show that 7 |[N’] is an ideal of R. 


Let F be a field, and let S be any subset of F x F x--- x F for n factors. Show that the set Ns of all 
fw,+++,%n) € Fle: Xn| that have every element (a1, ---, dn) of Sasa zero (see Exercise 28 of Section 22) 
is an ideal in F[j, ---, X,]. This is of importance in algebraic geometry. 


Show that a factor ring of a field is either the trivial (zero) ring of one element or is isomorphic to the field. 
Show that if R is a ring with unity and N is an ideal of K such that N ~ R, then R/N is a ring with unity. 


26. 


27, 


28. 


29. 


30. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


38. 
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Let R be a commutative ring and let a € R. Show that J, = {x € R| ax = 0} is an ideal of R. 
Show that an intersection of ideals of a ring R is again an ideal of R. 


Let R and R’ be rings and let N and N’ be ideals of R and R’, respectively. Let @ be a homomorphism of R 
into R’. Show that @ induces a natural homomorphism ¢, : R/N — R'/N’if 6[N] © N’. Use Exercise 39 of 
Section 14.) 

Let ¢ be a homomorphism of a ring R with unity onto a nonzero ring R’. Let wu be a unit in R. Show that (1) 
is a unit in R’. 

An element a of a ring R is nilpotent if a” = 0 for some n € Z*. Show that the collection of all nilpotent 
elements in a commutative ring R is an ideal, the nilradical of R. 


Referring to the definition given in Exercise 30, find the nilradical of the ring Z)2 and observe that it is one of 
the ideals of Z2 found in Exercise 3. What is the nilradical of Z? of Z32? 


Referring to Exercise 30, show that if N is the nilradical of a commutative ring R, then R/N has as nilradical 
the trivial ideal {O + N}. 


Let R be a commutative ring and N an ideal of R. Referring to Exercise 30, show that if every element of N 
is nilpotent and the nilradical of R/N is R/N, then the nilradical of R is R. 


Let R be a commutative ring and N an ideal of R. Show that the set \/N of all a € R, such that a” € N for 
some n € Z*, is an ideal of R, the radical of N. 


Referring to Exercise 34, show by examples that for proper ideals NV of a commutative ring R, 
a. VN need not equal N b. //N may equal N. 


What is the relationship of the ideal VN of Exercise 34 to the nilradical of R/N (see Exercise 30)? Word your 
answer carefully. 


Show that @ : C — Mp(R) given by 


pla + bi) = iS 2 


for a, b € R gives an isomorphism of C with the subring ¢[C] of M@2(R). 


Let R be a ring with unity and let End((R,+)) be the ring of endomorphisms of (R,-+) as described in 
Section 24. Leta € R, and let A, : R > R be given by 


Ag(x) = ax 
forx € R. 


a. Show that A, is an endomorphism of (R, +). 
b. Show that R’ = {A, [a € R} is a subring of End((R, +)). 
c. Prove the analogue of Cayley’s theorem for R by showing that R’ of (b) is isomorphic to R. 


PRIME AND MAXIMAL IDEALS 


Exercises 12 through 14 of the preceding section asked us to provide examples of factor 
rings R/N where R and R/N have very different structural properties. We start with 
some examples of this situation, and in the process, provide solutions to those exercises. 


27.1 Example As was shown in Corollary 19.12, the ring Z,, which is isomorphic to Z/pZ, is a field 
for p a prime. Thus a factor ring of an integral domain may be a field. A 
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27.2 Example 


27.3 Example 


27.4 Example 


27.5 Theorem 


Proof 


27.6 Corollary 


Proof 


Ideals and Factor Rings 


The ring Z x Z is not an integral domain, for 
(0, 11, 0) = (0, 0), 


showing that (0, 1) and (1, 0) are O divisors. Let NV = {(0, n)|[n € Z}. Now N isan ideal of 
Z x Z, and (Z x Z)/N is isomorphic to Z under the correspondence [(m, 0) + N] = m, 
where m € Z. Thus a factor ring of a ring may be an integral domain, even though the 
original ring is not. A 


The subset N = {0, 3} of Ze is easily seen to be an ideal of Ze, and Ze/N has three 
elements, 0+ N,1+N, and 24+ N. These add and multiply in such a fashion as to 
show that Z./N ~ Zs under the correspondence 


(0+ N) <0, Gd+N)< 1, Q+N) <2. 


This example shows that if R is not even an integral domain, that is, if R has zero divisors, 
it is still possible for R/N to be a field. A 


Note that Z is an integral domain, but Z/6Z ~ Ze is not. The preceding examples 
showed that a factor ring may have a structure that seems beter than the original ring. 
This example indicates that the structure of a factor ring may seem worse than that of 
the original ring. A 


Every nonzero ring R has at least two ideals, the improper ideal R and the trivial 
ideal {0}. For these ideals, the factor rings are R/R, which has only one element, and 


«R/{0}, which is isomorphic to R. These are uninteresting cases. Just as for a subgroup 


of a group, a proper nontrivial ideal of a ring R is an ideal N of R such that N #R 
and N + {0}. 

While factor rings of rings and integral domains may be of great interest, as the 
above examples indicate, Corollary 27.6, which follows our next theorem, shows that a 
factor ring of a field is really not useful to us. 


If R is aring with unity, and N is an ideal of R containing a unit, then N = R. 


Let N be an ideal of R, and suppose that u € N for some unit u in R. Then the condition 
rN CN forallr € R implies, if we taker = u-landu € N, that] = u-'wisin N. But 
thenrN CN for ally € R implies thatr1 =r isin N forallr e R,soN = R., Sd 


A field contains no proper nontrivial ideals. 
Since every nonzero element of a field is a unit, it follows at once from Theorem 27.5 


that an ideal of a field F is either {0} or all of F. + 


Maximal and Prime Ideals 


We now take up the question of when a factor ring is a field and when it is an integral 
domain. The analogy with groups in Section 15 can be stretched a bit further to cover 
the case in which the factor ring is a field. 


27.7 Definition 


27.8 Example 


27.9 Theorem 


Proof 


27.10 Example 


27.11 Corollary 
Proof 
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A maximal ideal of a ring R is an ideal M different from R such that there is no proper 
ideal N of R properly containing M. a 


Let p be a prime positive integer. We know that Z/pZ is isomorphic to Z,. Forgetting 
about multiplication for the moment and regarding Z/pZ and Z, as additive groups, 
we know that Z, is a simple group, and consequently pZ must be a maximal normal 
subgroup of Z by Theorem 15.18. Since Z is an abelian group and every subgroup is a 
normal subgroup, we see that pZ is a maximal proper subgroup of Z. Since pZ is an 
ideal of the ring Z, it follows that pZ is a maximal ideal of Z. We know that Z/pZ is 
isomorphic to the ring Z,, and that Zp is actually a field. Thus Z/pZ is a field. This 
illustrates the next theorem. A 


(Analogue of Theorem 15.18) Let R be a commutative ring with unity. Then M is a 
maximal ideal of R if and only if R/M is a field. 


Suppose M is a maximal ideal in R. Observe that if R is a commutative ring with 
unity, then R/M is also a nonzero commutative ring with unity if Mf 4 R, which is 
the case if M is maximal. Let (a + M) € R/M, with a ¢ M, so that a+ M is not 
the additive identity element of R/M. Suppose a + M has no multiplicative inverse 
in R/M. Then the set(R/M)(a + M) = {7 + M\(a+ M)|(7 + M) © R/M} does not 
contain 1 + M. We easily see that (R/M)(a + M) is an ideal of R/M. It is nontrivial 
because a ¢ M, and it is a proper ideal because it does not contain 1+ M. By the 
final paragraph of Section 26, if y : R > R/M is the canonical homomorphism, then 
y~1[(R/M)(a + M)]is a proper ideal of R properly containing M. But this contradicts 
our assumption that M is a maximal ideal, so a + 44 must have a multiplicative inverse 
in R/M. 

Conversely, suppose that R/M is a field. By the final paragraph of Section 26, if 
N is any ideal of R such that MC N Cc Rand y is the canonical homomorphism of R 
onto R/M, then y[N] is an ideal of R/M with {0 + M)} Cc yLN] C R/M. But this is 
contrary to Corollary 27.6, which states that the field R/M contains no proper nontrivial 
ideals. Hence if R/M isa field, M is maximal. Af 


Since Z/nZ is isomorphic to Z, and Z, is a field if and only if n is a prime, we see that 
the maximal ideals of Z are precisely the ideals pZ for prime positive integers p. A 


A commutative ring with unity is a field if and only if it has no proper nontrivial ideals. 


Corollary 27.6 shows that a field has no proper nontrivial ideals. 
Conversely, if a commutative ring R with unity has no proper nontrivial ideals, then 
{0} is a maximal ideal and R/{0}, which is isomorphic to R, is a field by Theorem 27.9. 
a 


We now turn to the question of characterizing, for a commutative ring R with unity. 
the ideals N + R such that R/N is an integral domain. The answer here is rather obvious. 
The factor ring R/N will be an integral domain if and only if (a+ N)\(b+N)=N 
implies that either 


a+N=N or b+N=N. 


27.12 Example 


27.13 Definition 


27.14 Example 


27.15 Theorem 


27.16 Corollary 


Proof 


Ideals and Factor Rings 


This is exactly the statement that R/N has no divisors of 0, since the coset N plays 
the role of 0 in R/N. Looking at representatives, we see that this condition amounts to 
saying that ab € N implies that either a € N or bEN. 


All ideals of Z are of the form nZ. Forn = 0, we have nZ = {0}, and Z/{0} ~ Z, which 
is an integral domain. For n > 0, we have Z/nZ ~ Z, and Z, is an integral domain if 
and only if n is a prime. Thus the nonzero ideals nZ such that Z/nZ is an integral domain 
are of the form pZ, where p is a prime. Of course, Z/pZ is actually a field, so that pZ 
is a maximal ideal of Z. Note that for a product rs of integers to be in pZ, the prime p 
must divide either r or s. The role of prime integers in this example makes the use of the 
word prime in the next definition more reasonable. A 


Anideal N # R ina commutative ring R is a prime ideal if ab € N implies that either 
aéeNorbe Nfora,beR. | 


Note that {0} is a prime ideal in Z, and indeed, in any integral domain. 


Note that Z x {0} is a prime ideal of Z x Z, for if (a, b)(c, d) € Z x {0}, then we must 
have bd = 0 in Z. This implies that either b = 0 so (a, b)€Zx {0}ord =Oso(c,d)€ 
Z x {0}. Note that (Z x Z)/(Z x {0}) is isomorphic to Z, which is an integral domain. 

A 


Our remarks preceding Example 27.12 constitute a proof of the following theorem, 
which is illustrated by Example 27.14. 


Let R be a commutative ring with unity, and let N # R be an ideal in R. Then R/N is 
an integral domain if and only if N is a prime ideal in R. 


Every maximal ideal in a commutative ring R with unity is a prime ideal. 


If M is maximal in R, then R/M is a field, hence an integral domain, and therefore M 
is a prime ideal by Theorem 27.15. Sa 


The material that has just been presented regarding maximal and prime ideals is 
very important and we shall be using it quite a lot. We should keep the main ideas well 
in mind. We must know and understand the definitions of maximal and prime ideals and 
must remember the following facts that we have demonstrated. 


For a commutative ring R with unity: 
1. Anideal M of R is maximal if and only if R/M isa field. 
2. Anideal N of R is prime if and only if R/N is an integral domain. 


3. Every maximal ideal of R is a prime ideal. 


27.17 Theorem 


Proof 


27.18 Corollary 


Proof 


27.19 Theorem 


Proof 


Section 27 Prime and Maximal Ideals 249 


Prime Fields 


We now proceed to show that the rings Z and Z,, form foundations upon which all rings 
with unity rest, and that Q and Z, perform a similar service for all fields. Let R be any 
ring with unity 1. Recall that by 1-1 we mean 1+ 1+.--+-+1 for n summands for 
n> 0, and (-1)+ (-D+.---+(—D) for |n| summands for n < 0, while n - 1 = 0 for 
n=0. 


If R is aring with unity 1, then the map ¢ : Z — R given by 
d(n)=n-l 
forn € Zis a homomorphism of Z into R. 
Observe that 
on +m) =(nt+m)-1=(1-)+(m-1)= 6) + om). 
The distributive laws in R show that 


Gepl se ieee) SD apes Lys 
—_—$—>= —s>s_ eer es eee” —__-_ 


n summands m summands nm summands 


Thus (n-1)(m-1) = (nm) - 1 forn, m > 0. Similar arguments with the distributive 
laws show that for alln, m € Z, we have 


(n-1)(m-1)=(m)-1. 
Thus 


o(nm) = (nm) - 1 = (n+ 1)(m - 1) = 6) b(n). ¢ 


If R is aring with unity and characteristic n > 1, then R contains a subring isomorphic 
to Z,. If R has characteristic 0, then R contains a subring isomorphic to Z. 


The map @ : Z— R given by ¢(m) = m- 1 for m € Zis a homomorphism by Theo- 
rem 27.17. The kernel must be an ideal in Z. All ideals in Z are of the form sZ for some 
s € Z. By Theorem 19.15 we see that if R has characteristic n > 0, then the kernel of 
isnZ. Then the image @[Z] < Ris isomorphic to Z/nZ ~ Z,,. if the characteristic of R 
is 0, then m- 1 4 0 for all m = 0, so the kernel of ¢ is {0}. Thus, the image ¢[Z] < R 
is isomorphic to Z. ¢ 


A field F is either of prime characteristic p and contains a subfield isomorphic to Z, or 
of characteristic 0 and contains a subfield isomorphic to Q. 


If the characteristic of F is not 0, the above corollary shows that F contains a subring 
isomorphic to Z,. Then n must be a prime p, or F would have 0 divisors. If F is of 
characteristic 0, then F must contain a subring isomorphic to Z. In this case Corollaries 
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27.20 Definition 


27.21 Definition 


27.22 Example 


27.23 Example 


27,24 Theorem 


Proof 
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21.8 and 21.9 show that F must contain a field of quotients of this subring and that this 
field of quotients must be isomorphic to Q. . 


Thus every field contains either a subfield isomorphic to Z, for some prime p ora 
subfield isomorphic to Q. These fields Z, and Q are the fundamental building blocks on 
which all fields rest. 


The fields Z, and Q are prime fields. a 


Ideal Structure in F [x] 


Throughout the rest of this section, we assume that F is a field. We give the next definition 
for a general commutative ring R with unity, although we are only interested in the case 
R = F[x]. Note that for a commutative ring R with unity anda € R, theset {ra|r € R} 
is an ideal in R that contains the element a. 


If R is a commutative ring with unity and a € R, the ideal {ra|r € R} of all multiples 
of a is the principal ideal generated by a and is denoted by (a). Anideal N of Risa 
principal ideal if N = (a) for some a € R. | 


Every ideal of the ring Z is of the form 2Z, which is generated by n, so every ideal of Z 
is a principal ideal. A 


The ideal (x) in F[x] consists of all polynomials in F[x] having zero constant term. 
A 


The next theorem is another simple but very important application of the division al- 
gorithm for F[x]. (See Theorem 23.1.) The proof of this theorem is to the division 
algorithm in F[x] as the proof that a subgroup of a cyclic group is cyclic is to the 
division algorithm in Z. 


If F isa field, every ideal in F [x] is principal. 


Let N be an ideal of F[x]. If N = {0}, then N = (0). Suppose that N # {0}, and let g(x) 
be a nonzero element of N of minimal degree. If the degree of g(x) is 0, then g@) € F 
andis aunit, so N = F[x] = (1) by Theorem 27.5, so N is principal. If the degree of g(x) 
is >1, let f(x) be any element of N. Then by Theorem 23.1, f(x) = ge(x)g) +rx), 
where r(x) = 0 or (degree r(x)) < (degree g(x)). Now f(x) EN and g(x) € N imply 
that f(x) — g(x)q(x) = r(x) is in N by definition of an ideal. Since g(x) is a nonzero 
element of minimal degree in N, we must have r(x) = 0. Thus f(x) = g(x)q(x) and 
N = (g(x)). 


We can now characterize the maximal ideals of F[x]. This is a crucial step in 
achieving our basic goal: to show that any nonconstant polynomial f(x) in F[x] has a 
zero in some field E containing F. 


27.25 Theorem 


Proof 


27.26 Example 


27.27 Theorem 


Proof 
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An ideal (p(x)) 4 {0} of F[x] is maximal if and only if p(x) is irreducible over F. 


Suppose that (p(x)) 4 {0} isa maximal ideal of F[x]. Then (p(x)) 4 F[x],so p(x) € F. 
Let p(x) = f(x)g(x) be a factorization of p(x) in F[x]. Since (p(x)) is a maximal 
ideal and hence also a prime ideal, (f(x)g(x)) € (p(x)) implies that f(x) € (p(x): or 
g(x) € (p(x)); that is, either f(x) or g(x) has p(x) as a factor. But then we can’t have 
the degrees of both f(x) and g(x) less than the degree of p(x). This shows that p(x) is 
irreducible over F. 

Conversely, if p(x) is irreducible over F, suppose that N is an ideal such that 
(p(x)) CN C F[x]. Now N is a principal ideal by Theorem 27.24, so N = (g(x)) for 
some g(x) € N.Then p(x) € N implies that p(x) = g(x)g(x) forsome q(x) € F[x]. But 
p(x) is irreducible, which implies that either g(x) or g(x) is of degree 0. If g(x) is of degree 
O, that is, a nonzero constant in F, then g(x) is a unit in F[x], so (g(x)) = N = F[x]. If 
q(x) is of degree 0, then g(x) = c, where c € F, and g(x) = (1/c)p(x) is in (p(x)), so 
N = (p(x)). Thus (p(x)) C N Cc F[x] is impossible, so (p(x)) is maximal. Ad 


Example 23.9 shows that x3 + 3x +2 is irreducible in Zs[x], so Zs[x]/(x? + 3x + 2) 
is a field. Similarly, Theorem 22.11 shows that x” — 2 is irreducible in Q[x], so Q[x]/ 
(x? — 2) is a field. We shall examine such fields in more detail later. A 


Application to Unique Factorization in F[x] 


In Section 23, we stated without proof Theorem 27.27, which follows. (See Theo- 
rem 23.18.) Assuming this theorem, we proved in Section 23 that factorization of poly- 
nomials in F[x] into irreducible polynomials is unique, except for order of factors and 
units in F. We delayed the proof of Theorem 27.27 until now since the machinery we 
have developed enables us to give such a simple, four-line proof. This proof fills the gap 
in our proof of unique factorization in F[x]. 


Let p(x) be an irreducible polynomial in F[x]. If p(x) divides r(x)s(x) for r(x), s(x) € 
F [x], then either p(x) divides r(x) or p(x) divides s(x). 


Suppose p(x) divides r(x)s(x). Then r(x)s(x) € (p(x)}, which is maximal by Theo- 


rem 27.25. Therefore, (p(x)) is a prime ideal by Corollary 27.16. Hence r(x)s(x) € 


{p(x)) implies that either r(x) € (p(x)), giving p(x) divides r(x), or that s(x) € (p(x)), 
giving p(x) divides s(x). . 


A Preview of Our Basic Goal 


We close this section with an outline of the demonstration in Section 29 of our basic 
goal. We have all the ideas for the proof at hand now; perhaps you can fill in the details 
from this outline. 

Basic goal: Let F be a field and let f(x) be a nonconstant polynomial in F[x]. 
Show that there exists a field E containing F and containing a zero a of f(x). 


eeeeessgsxgxy _ ©. 
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Outline of the Proof 


1. Let p(x) be an irreducible factor of f(x) in F[x]. 
2. Let E be the field F[x1/(p(x)). (See Theorems 27.25 and 27.9.) 


3, Show that no two different elements of F are in the same coset of F[x]/( p(x)), 
and deduce that we may consider F to be (isomorphic to) a subfield of E. 

4. Let o be the coset x + (p(x)) in E. Show that for the evaluation 
homomorphism ¢g : F [x] > E, we have ¢u( f(x) = 0. That is, w is a zero of 
f@)in E. 


An example of a field constructed according to this outline is given in Section 29. 


There, we give addition and multiplication tables for the field Zo[x]/ (x? +x + 1). We 


show there that this field has just four elements, the cosets 


O+?+x41), 1+ 074+x«4+)), xtO?+xt), 


and 


(x +1) + x? +x 41). 


We rename these four cosets 0, 1, a, and a + 1 respectively, and obtain Tables 29.20 
and 29.21 for addition and multiplication in this 4-element field. To see how these tables 
are constructed, remember that we are in a field of characteristic 2, so thata+a= 
a(i+1)=a0=0. Remember also that a isazeroofx? +x +1, sothata* +a+1=0 


and consequently a? =-a—l=a+l. 


2 


@ EXERCISES 27 


Computations 

1. Find all prime ideals and all maximal ideals of Zs. 
. Find all prime ideals and all maximal ideals of Zy2. 
. Find all prime ideals and al} maximal ideals of Zy x Zp. 
. Find all prime ideals and all maximal ideals of Zz x Za. 
. Find all c € Zs such that Zs[x]/(x* + ¢) isa field. 
. Find all c € Zs such that Za[x|/( + x? +c) isa field. 
. Find all c € Z; such that Zalxl/Oe + cx? +1) isa field. 
. Find all c € Zs such that Zs [x] / (x? +x+c) isa field. 
_ Find all c € Zs such that Zs[x]/(x? + ex + 1) isa field. 


Cmrnran bw DN 


Concepts 


In Exercises 10 through 13, correct the definition of the italicized term without reference to the text, if correction 


is needed, so that it is in a form acceptable for publication. 


10. A maximal ideal of a ring R is an ideal that is not contained in any other ideal of R. 


11. A prime ideal of a commutative ring R is an ideal of the form pR = {pr |r € R} for some prime p. 


12. 
13. 


14. 


15. 
16. 
17. 
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A prime field is a field that has no proper subfields. 


A principal ideal of a commutative ring with unity is an ideal N with the property that there exists a € N such 
that N is the smallest ideal that contains a. 


Mark each of the following true or false. 


. Every prime ideal of every commutative ring with unity is a maximal ideal. 


. Every maximal ideal of every commutative ring with unity is a prime ideal. 
. Qis its own prime subfield. 

The prime subfield of C is R. 

. Every field contains a subfield isomorphic to a prime field. 

A ring with zero divisors may contain one of the prime fields as a subring. 


a a Rm ORO Se Pp 


. Every field of characteristic zero contains a subfield isomorphic to Q. 

. Let F be a field. Since F[x] has no divisors of 0, every ideal of F [x] is a prime ideal. 
. Let F be a field. Every ideal of F[x] is a principal ideal. 

. Let F be a field. Every principal ideal of F [x] is a maximal ideal. 


| 


Find a maximal ideal of Z x Z. 
Find a prime ideal of Z x Z that is not maximal. 


Find a nontrivial proper ideal of Z x Z that is not prime. 


18. Is Q[x]/(x? — 5x +6) a field? Why? 

19. Is Q[x]/(x? — 6x + 6) a field? Why? 

Proof Synopsis 

20. Give a one- or two-sentence synopsis of “only if” part of Theorem 27.9. 


21. 
22. 


Give a one- or two-sentence synopsis of “if” part of Theorem 27.9. 


Give a one- or two-sentence synopsis of Theorem 27.24. 


23. Give a one- or two-sentence synopsis of the “only if” part of Theorem 27.25. 
Theory 
24, Let R be a finite commutative ring with unity. Show that every prime ideal in R& is a maximal ideal. 


25, 


26. 


27. 


28. 


29, 


Corollary 27.18 tells us that every ring with unity contains a subring isomorphic to either Z or some Zp. Is it 
possible that a ring with unity may simultaneously contain two subrings isomorphic to Z, and Z,, forn 4m? 
If it is possible, give an example. If it is impossible, prove it. 


Continuing Exercise 25, is it possible that a ring with unity may simultaneously contain two subrings isomorphic 
to the fields Z, and Z, for two different primes p and q? Give an example or prove it is impossible. 


Following the idea of Exercise 26, is it possible for an integral domain to contain two subrings isomorphic to 
Z, and Z, for p # q and p and q both prime? Give reasons or an illustration. 


Prove directly from the definitions of maximal and prime ideals that every maximal ideal of acommutative ring 
R with unity is a prime ideal. [Hint: Suppose M is maximal in R, ab € M, anda ¢ M. Argue that the smallest 
ideal {ra + m|r € R,m € M} containing a and M must contain 1. Express 1 as ra + m and multiply by b.] 


Show that N is a maximal ideal in a ring R if and only if R/WN is a simple ring, that is, it is nontrivial and has 
no proper nontrivial ideals. (Compare with Theorem 15.18.) 
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30. 
31. 
32. 


33. 
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Prove that if F is a field, every proper nontrivial prime ideal of F [x] is maximal. 
Let F bea field and f(x), g(x) € F[x]. Show that f(x) divides g(x) if and only if g(x) € (f(x)). 
Let F be a field and let f(x), g(x) € FLx]. Show that 


N = (rx) f(x) + s@) g(x) |r), s@) € Flx]}} 


is an ideal of F [x]. Show that if f(x) and g(x) have different degrees and N # F |x], then f(x) and g(x) cannot 
both be irreducible over F. 

Use Theorem 27.24 to prove the equivalence of these two theorems: 

Fundamental Theorem of Algebra: Every nonconstant polynomial in C[x] has a zero in C. 

Nullstellensatz for C[x]: Let f\(@), ---, f(x) € C[x] and suppose that every a € C that is a zero of all r of 
these polynomials is also a zero of a polynomial g(x) in C[x]. Then some power of g(x) is in the smallest ideal 
of C[x] that contains the r polynomials f\(x),---, f-(). 


There is a sort of arithmetic of ideals in a ring. The next three exercises define sum, product, and quotient of ideals. 


34. 


35. 


36. 


37. 


38. 


If A and B are ideals of aring R, the sum A + B of A and B is defined by 
A+B={at+blaeA,be Bh. 

a. Show that A + B is an ideal. b. Show that A C A+ Band BCA+B. 

Let A and B be ideals of aring R. The product AB of A and B is defined by 


AB= | Soatia €A,b; € Bane 2'| 
i=1 
a. Show that AB is an ideal in R. b. Show that AB C (AM B). 


Let A and B be ideals of a commutative ring R. The quotient A : B of A by B is defined by 
A:B={reR|rbe€Aforallb € B}. 


Show that A : B is an ideal of R. 
Show that for a field F, the set S of all matrices of the form 


a b 
0 0 
for a,b € F is a right ideal but not a left ideal of 14)(F). That is, show that 5 is a subring closed under 


multiplication on the right by any element of M>(F), but is not closed under eft multiplication. 
Show that the matrix ring M>(Z2) is a simple ring; that is, M2(Z2) has no proper nontrivial ideals. 


+GROBNER BASES FOR IDEALS 


This section gives a brief introduction to algebraic geometry. In particular, we are con- 
cerned with the problem of finding as simple a description as we can of the set of common 
zeros of a finite number of polynomials. In order to accomplish our goal in a single sec- 
tion of this text, we will be stating a few theorems without proof. We recommend the 
book by Adams and Loustaunau [23] for the proofs and further study. 


¥ This section is not used in the remainder of the text. 


28.1 Definition 


28.2 Example 


28.3 Definition 


28.4 Theorem 


Proof 
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Algebraic Varieties and Ideals 


Let F be a field. Recall that F[x,, x2,--++,x,] is the ring of polynomials in n inde- 
terminants x1, <2,---,%, with coefficients in F. We let F” be the Cartesian product 
F x F x--- x F forn factors. For ease in writing, we denote an element (a, a2, +++ , Qn) 
of F” by a, in bold type. Using similar economy, we let F[x] = F[x1, x2, +++, Xp]. For 
each a € F”, we have an evaluation homomorphism ¢,: F[x] > F just as in Theo- 
rem 22.4. That is, for f(x) = f(x, X2,-++, Xn) € F[x], we define g( f(x) = f(a) = 
f(a), a2, +++, a,_). The proof that dg is indeed a homomorphism follows from the asso- 
ciative, commutative, and distributive properties of the operations in F [x] and F’. Just as 
for the one-indeterminate case, an element a of F” is a zero of f(x) € F[x] if f(a) = 0. 
In what follows, we further abbreviate a polynomial f(x) by “f.” 

In this section we discuss the problem of finding common zeros in F” of a finite num- 
ber of polynomials fi, fo,---, f- in F [x]. Finding and studying geometric properties of 
the set of all these common zeros is the subject of algebraic geometry. 


Let S be a finite subset of F [x]. The algebraic variety V(S) in F” is the set of all 
common zeros in F” of the polynomials in S. 1] 


In our illustrative examples, which usually involve at most three indeterminates, we 
use x, y, Zin place of x;, x2, and x3. 


Let S = {2x + y — 2} C R[x, y]. The algebraic variety V(S) in R? is the line with 
x-intercept 1 and y-intercept 2. A 


“We leave to Exercise 29 the straightforward proof that forr elements fi, fo,-:-, f; 
in a commutative ring R with unity, the set 


l={qfitoafht+---+tof-lco ¢R fori =1,---,r} 


is an ideal of R. We denote this ideal by (fi, fo.---, f-). We are interested in the case 
R = F [x] where all the c; and ail the /; are polynomials in F[x]. We regard the c; as 
“coefficient polynomials.” By its construction, this ideal / is the smallest ideal containing 
the polynomials f;, fo,---, f-; it can also be described as the intersection of all ideals 
containing these r polynomials. 


Let J be an ideal in a commutative ring R with unity. A subset {b,, bo,---,b,} of lisa 
basis for J if J = (by, bo, ---,b,). | 

Unlike the situation in linear algebra, there is no requirement of independence for 
elements of a basis, or of unique representation of an ideal member in terms of a basis. 


Let fi, fo,---, fr € F[x]. The set of common zeros in F” of the polynomials f; for 
i=1,2,---,ris the same as the set of common zeros in F” of all the polynomials in 
the entire ideal J = (fi, fo,-++, fr). 


Let 
fHafitoafat- +f, (1) 
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28.5 Theorem 


28.6 Theorem 


Proof 


Ideals and Factor Rings 


be any element of J, and let a € F” be acommon zero of fi, fo,---, and f,. Applying 
the evaluation homomorphism ¢, to Eq. (1), we obtain 


f(a) = era fila) + caa fra) +--+ + cr-@)f,-@) 
= c;(a)0 + c2(a)0 +--+ +4+c,(a)0 = 0, 


showing that a is also a zero of every polynomial f in J. Of course, a zero of every 
polynomial in J will be a zero of each f; because each f; € J. ¢ 


For an ideal J in F'[x], we let V(Z) be the set of all common zeros of all elements 
of 7. We can summarize Theorem 28.4 as 


Vf fa AD = VACA, fas FD). 
We state without proof the Hilbert Basis Theorem. (See Adams and Loustaunau [23].) 


(Hilbert Basis Theorem) Every ideal in F'[x,, x2,---, x,] has a finite basis. 


Our objective: Given a basis for an ideal J in F[x], modify it if possible to 
become a basis that better exhibits the structure of J and the geometry of the 
associated algebraic variety V(J). 


The theorem that follows provides a tool for this task. You should notice that the 
theorem gives information about the division algorithm that we did not mention in 
Theorem 23.1. We use the same notation here as in Theorem 23.1, but with x rather 
than x. If f(x) = g(x)h(x) in F(x), then g(x) and [x] are called “divisors” or “factors” 
of f(x). 


(Property of the Division Algorithm) Let f(x), g(x), g(x) and r(x) be polynomials 
in F[x] such that f(x) = g(x)q(x) + r(x). The common zeros in F” of f(x) and g(x) 
are the same as the common zeros of g(x) and r(x). Also the common divisors in F[x] 
of f(x) and g(x) are the same as the common divisors of g(x) and r(x). 

If f(x) and g(x) are two members of a basis for an ideal J of F [x], then replacement 
of f(x) by r(x) in the basis still yields a basis for J. 


If a € F” is a common zero of g(x) and r(x), then applying ¢, to both sides of the 
equation f(x) = g(x)¢(x) + r(x), we obtain f(a) = g(a)q(a) + r(a) = Og(a) +0 = 0, 
so a is a zero of both f(x) and g(x). If b € F[x] is a common zero of f(x) and g(x), 
then applying @» yields f(b) = g(b)g(b) + r(b) so 0 = Og(b) + r(b) and we see that 
r(b) = 0 as well as g(b). 

The proof concerning common divisors is essentially the same, and is left as Exer- 
cise 30. 

Finally, let B be a basis for an ideal J, let f(x), g(x), € B and let f(x) = g(x)g(x) + 
r(x). Let B’ be the set obtained by replacing f(x) by r(x) in B, and let I’ be the ideal 
having B’ as a basis. Let S be the set obtained from B by adjoining r(x) to B. Note that 
S can also be obtained by adjoining f(x) to B’. The equation f(x) = g(x)g(x) + r(x) 


28.7 Example 


Solution 
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shows that f(x) € J’, so we have B’ C S C J’. Thus S is a basis for 7’. The equation 
r(x) = f(x) — 4(x)g(x) shows that r(x) € I, so we have B C SC I. Thus S is basis 
for I. Therefore J = I' and B’ is a basis for J. 5 


A Familiar Linear Illustration 


A basic technique for problem solving in linear algebra is finding all common solutions 
of a finite number of linear equations. For the moment we abandon our practice of never 
writing “f(x) = 0” for a nonzero polynomial, and work a typical problem as we do in a 
linear algebra course. 


(Solution as in a Linear Algebra Course) Find all solutions in IR? of the linear system 


x+y-3z= 8 
Q2x+y+ z=-5. 


We multiply the first equation by —2 and add it to the second, obtaining the new system 
x+y—3z=8 
-yt7z=—21 
which has the same solution set in R? as the preceding one. For any value z, we can 
find the corresponding y-value from the second equation and then determine x from 


the first equation. Keeping z as parameter, we obtain {(—4z — 13, 7z + 21, z)|z € R} 
as solution set, which is a line in Euclidean 3-space through the point (—13, 21,0). A 


Tn the notation of this section, the problem in the preceding example can be phrased 
as follows: 


Describe V((x + y —3z—8, 2x ty +z2+5)) in R?. 
We solved it by finding a more useful basis, namely 
{x + y —3z—8,-y+72+ 21}. 

Notice that the second member, —y + 7z + 21, of this new basis can be obtained from 
the original two basis polynomials as a remainder r(x, y, z) ina division process, namely 
2 
x+y—3z—-8) 2x+ y+ z+ 5 

2x + 2y — 6z — 16 
—y+7z+21 
Thus 2x +}y+z2+5=(¢+y—3z—8)(2)+ (—y + 7z +21), an expression of the 
form f(x, y,z) = g(x, ¥, Dg, y,z) +r, y, z). We replaced the polynomial f by 
the polynomial r, as in Theorem 28.6, which assures us that V((f, g)) = V((g.7r)) and 
that (f, g) = (g, 7). We chose a very simple, 1-step problem in Example 28.7. However, 
it is clear that the method introduced in a linear algebra course for solving a linear system 
can be phrased in terms of applying a division algorithm process repeatedly to change a 


given ideal basis into one that better illuminates the geometry of the associated algebraic 
variety. 
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A Single Indeterminate Illustration 


Suppose now that we want to find the variety V(/) in R associated with an ideal J in 
F [x], the ring of polynomials in the single indeterminate x. By Theorem 27.24, every 
ideal in F[x] is principal, so there exists f(x) € F[x] such that J = (f(x)). Thus V(J) 
consists of the zeros of a single polynomial, and { /(x)} is probably as simple a basis 
for I as we could desire. We give an example illustrating computation of such a single 
generator f(x) for J in a case where the given basis for 7 contains more than one 
polynomial. Because a polynomial in R{x] has only a finite number of zeros in R, we 
expect two or more randomly selected polynomials in R[x] to have no common zeros, 
but we constructed the basis in our example carefully! 


Let us describe the algebraic variety V in R consisting of common zeros of 
f(x) =xt4x3-3x7-5x—-2 and = g(x) = x° 43x? -—6x-8. 


We want to find a new basis for ( f, g) having polynomials of as small degree as possible, 
so we use the division algorithm f(x) = g(x)g(x) + r(x) in Theorem 23.1, where r(x) 


will have degree at most 2. We then replace the basis { f, g} by the basis {g, r}. 
x—2 
cx = 6S SS ba ay SS 8 
x4 43x93 — 6x7 — 8x 


—2x34+3x774+ 3x- 2 
— 2x3 — 6x2 + 12x + 16 
9x? — 9x — 18 


Because zeros of 9x2 — 9x — 18 are the same as zeros of x? — x — 2, we let r(x) = 
x? — x — 2, and take as new basis 


{g,r} = (2 + 3x? — 6x —8, x? —x -2). 


By dividing g(x) by r(x) to obtain a remainder r(x), we will now be able to find a basis 
{r(x), r1(x)} consisting of polynomials of degree at most 2. 


x+4 


x? —x —2] x? +3x7 -—6x -—8 


x2 — x? 2x 


4x? —4x — 8 
4x? —4x —8 
0 


Our new basis {r(xc), r1(c)} now becomes {x? — x — 2}. Thus J = (f(x), g(x)) = 
(x? —x —2) = (x —2)(x + 1), and we see that V = {—1, 2}. A 


Theorem 28.6 tells us that the common divisors of f(x) and g(x) in the preceding 
example are the same as the common divisors of r(x) and r;(x). Because 0 = (0)r(x), 
we see that r(x) itself divides 0, so the common divisors of f(x) and g(x) are just those 
of r(x), which, of course, include r(x) itself. Thus r(x) is called a “greatest common 
divisor” (abbreviated gcd) of f(x) and g(x). 
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Grébner Bases 


We tackle the problem of finding a nice basis for an ideal J in F[x] = F[x1, X2,°+-. Xa]. 
In view of our illustrations for the linear and single indeterminant cases, it seems reason- 
able to try to replace polynomials in a basis by polynomials of lower degree, or containing 
fewer indeterminates. It is crucial to have a systematic way to accomplish this. Every 
instructor in linear algebra has had an occasional student who refuses to master matrix 
reduction and creates zero entries in columns of a matrix in an almost random fashion, 
rather than finishing the first column and then proceeding to the second, etc. As a first 
step in our goal, we tackle this problem of specifying an order for polynomials in a basis. 


Our polynomials in F[x] have terms of the form ax"'x,"? -- +x," where a € F. 


Properties for an Ordering of Power Products 


1. 1 < P for all power products P ¥ 1. 

2. For any two power products P; and P;, exactly one of 
P; < Pj, P; = Pj, Pj < FP; holds. 

3. If P; < P; and P; < Py, then P; < Py. 

4. If P, < P;, then PP; < PP; for any power product P. 


Let us consider a power product in F[x] to be an expression 


P= x/"'x)"" +--+ x," where all the m; > 0 in Z. 


Notice that all x; are present, perhaps some with exponent 0. Thus in F'[x, y, z}, we must 
write xz” as xy°z? to be a power product. We want to describe a total ordering < on 
the set of all power products so that we know just what it means to say that P, < P; for 
two power products, providing us with a notion of relative size for power products. We 
can then try to change an ideal basis in a systematic way to create one with polynomials 
having terms a; P; with as “small” power products P; as possible. We denote by 1 the 
power product with all exponents 0, and require that an ordering of the power products 
has the properties shown in the box. Suppose that such an ordering has been described 
and that P; # P,; and P; divides P; so that P; = PP; where | < P. From Property 4 
in the box, we then have 1P, < PP, = P;,so P, < P;. Thus P; divides P; implies that 
P;, < P;. In Exercise 28, we ask you to show by a counterexample that P; < P; does 
not imply that P; divides P;. 

It can also be shown that these properties guarantee that any step-by-step process 
for modifying a finite ideal basis that does not increase the size of any maximal power 
product in a basis element and replaces at least one by something smaller at each step 
will terminate in a finite number of steps. 

In F [x] with x the only indeterminate, there is only one power product ordering, for 
by Property 1, we must have 1 < x. Multiplying repeatedly by x and using Property 4, 
we have x < x?, x? < x, etc. Property 3 then shows that 1 < x <x? < x3 <.---is the 
only possible order. Notice that in Example 28.8, we modified a basis by replacing basis 
polynomials by polynomials containing smaller power products. 
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There are a number of possible orderings for power products in F[x] with n inde- 
terminates. We present just one, the lexicographical order (denoted by “lex”). In lex, we 
define 


Si. 82 5, hy fe t 
HP Ny? KM < Ky Ky? Ky” (2) 


if and only if s; < #; for the first subscript i, reading from left to right, such that si is : 
Thus in F[x, y], if we wie power products in the order x” y”, we have y = xy! 
xly® = x andxy < xy*. Using lex, the order of n fdeieaninaiesi is given by 1 < x, < 
Xe <00+ < Xo < x,. Our reduction in Example 28.7, where we first got rid of all “big” 
x°s that we could and then the “smaller” y’s, corresponded to the lex order z < y < x, 
that is, to writing all power products in the x” y"z* order. For the two-indeterminate case 
with y < x, the total lex term order schematically is 

l<y <y<yee<x<xy<xy’ <xy Beg? egy ee yo Sees 

An ordering of power products P induces an obvious ordering of terms aP of a 
polynomial in F[x], which we will refer to as a term order. From now on, given an 
ordering of power products, we consider every polynomial f in F[x] to be written in 
decreasing order of terms, so that the leading (first) term has the highest order. We 
denote by 1t(f) the leading term of f and by 1p(f) the power product of the leading 
term. If f and g are polynomials in F[x] such that lp(g) divides 1p(f), then we can 
execute a division of f by g, as illustrated in the linear and one-indeterminate cases, 
to obtain f(x) = g(x)g(x) + r(x) where Ip(r) < Ip(f). Note that we did not say that 
1p(r) < 1p(g). We illustrate with an example. 


BY division, reduce the basis {xy?, y? — y} for the ideal J = (xy”, y* — y) in R[x, y] to 
one with smaller maximum term size, assuming the order lex with y < x. 


We see that y” divides xy? and compute 


x 
y>—y| xy? 
xy? — xy 


ay 


Because y* does not divide xy, we cannot continue the division. Note that Ip(vy) = xy 
is not less than 1p(y? — y) = y?. However, we do have lp(xy) < 1p(xy”). Our new basis 
for I is {xy, y? — y}. A 


When dealing with more than one indeterminate, it is often easier to perform basis 
reduction by multiplying a basis polynomial g(x) by a polynomial —q(x) and adding it 
to a polynomial f(x) to obtain r(x), as we perform matrix reduction in linear algebra, 
rather than writing out the division display as we did in the preceding example. Starting 
with basis polynomials xy? and y? — y, we can reduce the xy? by multiplying y? — y by 
—x and adding the resulting —xy” + xy to xy’, obtaining the replacement xy for x y?. 
We can do that in our head, and write down the result directly. 

Referring again to Example 28.9, it will follow from what we state later that given 
any polynomial f(x, y) = ci(x, yxy) + e2(x, yO? — y) in (xy, y? — y), either xy or 
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y* will divide 1p(f). (See Exercises 31.) This illustrates the defining property of a 
Grébner basis. 


A set {g1, £2, °°, 8} of nonzero polynomials in F[x1, x2, +--+, x,], with term ordering 
<, is a Grobner basis for the ideal J = (g1, 90,---, g,) if and only if, for each nonzero 
fF <I, there exists some i where 1 <i <r such that 1p(g;) divides Ip(/). E 


While we have illustrated the computation of a Grdbner basis from a given basis 
for an ideal in Examples 28.7, 28.8, and 28.9, we have not given a specific algorithm. 
We refer the reader to Adams and Loustaunau [23]. The method consists of multiplying 
some polynomial in the basis by any polynomial in F[x] and adding the result to another 
polynomial in the basis in a manner that reduces the size of power products. In our 
illustrations, we have treated the case involving division of f(x) by g(x) where Ip(g) 
divides 1p(f), but we can also use the process if 1p(g) only divides some other power 
product in f. For example, if two elements in a basis are xy — y? and y* — 1, we can 
multiply y? — 1 by y and add it to xy — y’, reducing xy — y’ to xy — y. Theorem 28.6 
shows that this is a valid computation. 


You may wonder how any basis {g1, g2,---, g-} can fail to be a Grobner basis for 
I = (£1, 82,+°+, 8) because, when we form an element c1g1 + cog. +++: +c,g, in I, 
we see that Ip(g;) is a divisor of 1p(c;g;) fori = 1, 2, ---, 7. However, cancellation of 


power products can occur in the addition. We illustrate with an example. 


Consider the ideal J = (x?y — 2, xy? — y) in REx, y}. The polynomials in the basis 
shown cannot be reduced further. However, the ideal J contains y(x?y — 2) — x(xy* — 
y) = xy — 2y, whose leading power product xy is not divisible by either of the leading 
power products x*y or xy? of the given basis. Thus {x?y — 2, xy? — y} is not a Grébner 
basis for J, according to Definition 28.10. A 


When we run into a situation like that in Example 28.11, we realize that a Grébner 
basis must contain some polynomial with a smaller leading power product than those 
in the given basis. Let f and g be polynomials in the given basis. Just as we did in 
Example 28.11, we can multiply f and g by as small power products as possible so that 
the resulting two leading power products will be the same, the least common multiple 
(Icm) of 1lp(f) and 1p(g), and then subtract or add with suitable coefficients from F so 
cancellation results. We denote a polynomial formed in this fashion by S(f, ). We state 
without proof a theorem that can be used to test whether a basis is a Grdbner basis. 


A basis G = {g1, 82,--+, gr} is a Grobner basis for the ideal (g1, g2,---, g-) if and only 
if, for alli A j, the polynomial S(g;, g;) can be reduced to zero by repeatedly dividing 
remainders by elements of G, as in the division algorithm. 


As we mentioned before, we may prefer to think of reducing S(g;, 9) by a sequence 
of operations consisting of adding (or subtracting) multiples of polynomials in G, rather 
than writing out division. 

We can now indicate how we can obtain a Grébner basis from a given basis. First, 
reduce the polynomials in the basis as far as possible among themselves. Then choose 
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polynomials g; and g; in the basis, and form the polynomial S(g;, g;). See if S(g;, g;) 
can be reduced to zero as just described. If so, choose a different pair of polynomials, 
and repeat the procedure with them. If S(g;, g;) cannot be reduced to zero as described 
above, augment the given basis with this S(g;, g;), and start all over, reducing this basis 
as much as possible. By Theorem 28.12, when every polynomial S(g;, ¢;) for alli A j 
can be reduced to zero using polynomials from the latest basis, we have arrived at a 
Grdbner basis. We conclude with a continuation of Example 28.11. 


Continuing Example 2.8.11, let g) = x*y — 2, g2 = xy? — y, and J = (g1, go) in R?. 
In Example 28.11, we obtained the polynomial S(g1, g2) = xy — 2y, which cannot be 
reduced to zero using g; and g). We now reduce the basis {x?y — 2, xy? — y, xy —2y}, 
indicating each step. 


{x?y —2, xy? — y, xy —2y} augmented basis 

{Qxy — 2, xy? — y, xy —2y} by adding (—x) (third) to first 
{Qxy — 2, 2y* — y, xy —2y} by adding (—y) (third) to second 
{4y — 2, 2y? — y, xy — 2y} by adding (—2) (third) to first 


{4y — 2,0, xy — 2y} by adding (— 3) (first) to second 
{4y — 2, 0, bx — 2y} by adding (— 4) (first) to third 
{4y — 2, 0, 5x — 1} by adding G) (first) to third 


Clearly, {y — 4, x — 2} is a Grobner basis. Note that if f = y — and g = x — 2, then 
S(f, g) =xf — yg = (xy — $) — Gy — 2y) = —} + 2y, which can readily be reduced 


_to zero by adding }(x — 2) and —2(y — $). 


From the Grdbner basis, we see that the algebraic variety V(/) contains only one 
point, (2, 4), in R?. A 


The importance of Grébner bases in applications is due to the fact that they are 
machine computable. They have applications to engineering and computer science as 
well as to mathematics. 


@ EXERCISES 28 


In Exercises 1 through 4, write the polynomials in R[x, y, z] in decreasing term order, using the order lex for power 
products x” y"z* where z< y < x. 


: 
i 


1, 2xy3Z> — 5x? yz3 + 7x2 y2z — 3x3 2. 3y?2° — 4x + Sy3z3 — 82? 
3. 3y — 7x + 102? — 2xy?22? + 2x2 y2? 4. 38 — 4xz+2yz — &xy + 3yz3 


In Exercises 5 through 8, write the polynomials in R[x, y, z] in decreasing term order, using the order lex for power 
products 7” y"x* where x < y < z. 


5. The polynomial in Exercise 1. 6. The polynomial in Exercise 2. 


7. The polynomial in Exercise 3. 8. The polynomial in Exercise 4. 


Another ordering, deglex, for power products in F[x] is defined as follows: 


Si, 52 Sn Pings EB. contala 
Xp Ky? Hy" < Ky Xy Xn 
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if and only if either }77_, 5; < }-7_, #;, or these two sums are equal and s; < ft; for the smallest value of i such that 
5; & t;. Exercises 9 through 13 are concerned with the order deglex. 


9, List, in increasing order, the smallest 20 power products in R[x, y, z] for the order deglex with power products 
x™y"z’ where z << y <x. 
In Exercises 10 through 13, write the polynomials in order of decreasing terms using the order deglex with power 
products x” y"z*> where z < y < x. 
10. The polynomial in Exercise 1. 11. The polynomial in Exercise 2. 
12. The polynomial in Exercise 3. 13. The polynomial in Exercise 4. 


For Exercises 14 through 17, let power products in R[x, y, z] have order lex where z < y < x. If possible, perform 
a single-step division algorithm reduction that changes the given ideal basis to one having smaller maximum term 
order. 


14, (xy? — 2x, x?y + 4xy, xy — y?) 15. (xy ty?, > +z,x-y%) 
16. (xyz — 327, x? + y2z3, x2yz3 + 4) 17. (y?z3 +3, yz? — 2z, y?z? +3) 


In Exercises 18 and 19, let the order of power products in R[w, x, y, z] be lex withz < y < x < w. Find a Grébner 
basis for the given ideal. 


18. wtx—y+42—-3,2w+x+y—224+4,w+3x-3y4+z—-5) 

19. Ww —4x4+3y —z2+2,2w —2x+y—2z4+5,w —10x+8y—z- 1) 
In Exercises 20 through 22, find a Grdbner basis for the indicated ideal in R[x]. 
20. (x4 +.x3 — 3x? — 4x —4, x9 4 x? — 4x — 4) 

21. (x4 — 4x03 + 5x? — 2x, 03 — x? — 4x 4-4, x7 — 3x 4-2) 

22. (x +x? 42x —5, x9 —x? +x —-1) 


In Exercises 23 through 26, find a Grébner basis for the given ideal in R[x, y]. Consider the order of power products 
to be lex with y < x. If you can, describe the corresponding algebraic variety in R[x, y]. 


23. (x?y —x —2,xy+2y—9) 24, (x2y +x, xy* — y) 
25. (x?y+xt+1, xy? +y—-1) 26. (x?y +xy?, xy — x) 
Concepts 


27. Let F bea field. Mark each of the following true or false. 


a. Every ideal in F[x] has a finite basis. 
b. Every subset of R? is an algebraic variety. 


ce. The empty subset of R? is an algebraic variety. 


d. Every finite subsct of R? is an algebraic variety. 


e. Every line in R? is an algebraic variety. 
f. Every finite collection of lines in R? is an algebraic variety. 


g. A greatest common divisor of a finite number of polynomials in R[x] (one indeterminate) can be 
computed using the division algorithm repeatedly. 

h. I have computed Grébner bases before I knew what they were. 

i. Any ideal in F[x] has a unique Groébner basis. 

j. The ideals (x, y) and (x, y?) are equal because they both yield the same algebraic variety, namely 
{(0, 0)}, in R?. 


28. Let R[x, y] be ordered by lex. Give an example to show that P; < P, does not imply that P; divides P;. 
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29. Show that if fi, fo,---. f, are elements of a commutative ring R with unity, then J = {ei ft torfete:- + 
cf, |e; € I fori =1,---,r} is anideal of R. 


30. Show that if f(x) = g(x)g(x) + r(x) in F[Xx], then the common divisors in F[x] of f(x) and g(x) are the same 
as the common divisors in F'[x] of g(x) and r(x). 


31. Show that {xy, y? — y) is a Grobner basis for (xy, y? — y}, as asserted after Example 28.9. 
32. Let F be a field. Show that if S is a nonempty subset of F”, then 
I(S) = {f() € F[x]| f(s) = 0 for alls € S} 

is an ideal of F'[x]. 
33. Referring to Exercise 32, show that S € V(J(S)). 
34, Referring to Exercise 32, give an example of a subset 5 of R? such that VU(S)) # S. 
35. Referring to Exercise 32, show that if N is an ideal of F [x], then N C [(V(N)). 
36. Referring to Exercise 32, give an example of an ideal N in R[x, y] such that 1(V(N)) 4 N. 


29.1 Definition 


Extension Fields 


Section 29 = Introduction to Extension Fields 
Section 30 Vector Spaces 

Section 31 Algebraic Extensions 

Section 32 ‘Geometric Constructions 
Section 33 _ Finite Fields 


INTRODUCTION TO EXTENSION FIELDS 


Our Basic Goal Achieved 


Weare now in a position to achieve our basic goal, which, loosely stated, is to show that 
every nonconstant polynomial has a zero. This will be stated more precisely and proved 
in Theorem 29.3. We first introduce some new terminology for some old ideas. 


A field E is an extension field of a field F if F < E. | 
Cc Fs, y) 
R F \ ea Q) 
Q F 
29.2 Figure 


Thus R is an extension field of Q, and C is an extension field of both = and 
Q. As in the study of groups, it will often be convenient to use subfield diagrams to 
picture extension fields, the larger field being on top. We illustrate this in Fig. 29.2. A 
configuration where there is just one single column of fields, as at the left-hand side of 
Fig. 29.2, is often referred to, without any precise definition, as a tower of fields. 


T Section 32 is not required for the remainder of the text. 


265 


266 


Now for our basic goal! This great and important result follows quickly and elegantly 


(Kronecker’s Theorem) (Basic Goal) Let F bea field and let f(x) be a nonconstant 
polynomial in F[x]. Then there exists an extension field E of F and ana € E such that 


Part VI Extension Fields 
from the techniques we now have at our disposal. 
29.3 Theorem 
f(a) = 0. 
Proof 


By Theorem 23.20, f(x) has a factorization in F [x] into polynomials that are irreducible 
over F. Let p(x) be an irreducible polynomial in such a factorization. It is clearly 
sufficient to find an extension field E of F containing an element w such that p(a) = 0. 

By Theorem 27.25, (p(x)) is a maximal ideal in [x], so F[x]/(p(x)) isa field. We 
claim that F can be identified with a subfield of F[x]/(p(x)) in a natural way by use of 
the map y : F > F[x]/(p()) given by 

(a) = a + (p(x)) 

fora € F. This map is one to one, forif w(a) = yr(b), that is, ifa + (p(x)) = b + (p(x)} 
for some a, b € F, then (a — b) € (p(x)), soa — b must be a multiple of the polynomial 
p(x), which is of degree >1. Now a, b € F implies that a — b is in F. Thus we must 
have a — b =0, soa =b. We defined addition and multiplication in F[x]/(p(x)) by 
choosing any representatives, so we may choose a € (a + (p(x))). Thus yw is a homo- 
morphism that maps F’ one-to-one onto a subfield of F[x]/(p(x)). We identify F with 
{a + (p(x)) |a € F} by means of this map w. Thus we shall view E = F[x]/(p@)) as 
an extension field of F. We have now manufactured our desired extension field E of F’. 


It remains for us to show that Z contains a zero of p(x). 


* Let us set 


so a € E. Consider the evaluation homomorphism ¢z : F[x] > E, given by Theo- 


a =x + (p(x), 


rem 22.4. If p(x) = a9 tax tee + a,x", where a; € F, then we have 
aC p(X) = do + ar(x + (p(x))) +++ + ane + (p@))Y" 


# Historica Note 


Le Kronecker is known for his insistence 
on constructibility of mathematical objects. As 
he noted, “God made the integers; all else is the 
work of man.” Thus, he wanted to be able to con- 
struct new “domains of rationality” (fields) by using 
only the existence of integers and indeterminates. 
He did not believe in starting with the real or com- 
plex numbers, because as far as he was concerned, 
those fields could not be determined in a construc- 
tive way. Hence in an 1881 paper, Kronecker cre- 
ated an extension field by simply adjoining to a 
given field a root @ of an irreducible nth degree 
polynomial p(x); that is, his new field consisted of 


expressions rational in the original field elements 
and his new root « with the condition that p(a) = 0. 
The proof of the theorem presented in the text 
(Theorem 29.3) dates from the twentieth century. 
Kronecker completed his dissertation in 1845 
at the University of Berlin. For many years there- 
after, he managed the family business, ultimately 
becoming financially independent. He then returned 
to Berlin, where he was elected to the Academy of 
Sciences and thus permitted to lecture at the univer- 
sity. On the retirement of Kummer, he became a pro- 
fessor at Berlin, and with Karl Weierstrass (1815- 
1897) directed the influential mathematics seminar. 


29.4 Example 


29.5 Example 


29.6 Definition 
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in E = F[x]/(p(x)). But we can compute in F[x]/(p(x)) by choosing representatives, 
and x is a representative of the coset a = x + (p(x)). Therefore, 


plot) = (ao + ax ++ +++ ayx") + (p(x) 
= p(x) + (p()) = (p(x) = 0 


in F[x]/(p(x)). We have found an element a in E = F[x]/(p(x)) such that p(a) = 0, 
and therefore f(a) = 0. Sd 


We illustrate the construction involved in the proof of Theorem 29.3 by two exam- 
ples. 


Let F = R, and let f(x) = x? + 1, which is well known to have no zeros in R and thus 
is irreducible over R by Theorem 23.10. Then (x? + 1) is a maximal ideal in R[+], so 
R[x]/ (x? + 1) is a field. Identifying r € R with r + (x? + 1) in R[x]/(x? + 1), we can 
view R as a subfield of E = R[x]/(x? + 1). Let 


a=xt+(x?4+1). 
Computing in R[x]/(x? + 1), we find 


e+laiet le? t+ nr+d+’ +1) 
= (x7 4+ 1)4+ (x7 +1) =0. 


Thus @ is a zero of x? + 1. We shall identify R[x]/(x? + 1) with C at the close of 
this section. A 


Let F = Q, and consider f(x) = x*+ —5x?46. This time f(x) factors in Q[x] into 
(x? — 2)(x? — 3), both factors being irreducible over Q, as we have seen. We can start 
with x? — 2 and construct an extension field E of Q containing a such that w? — 2 = 0, or 
we can construct an extension field K of Q containing an element B such that B* — 3 = 0. 
The construction in either case is just as in Example 29.4. A 


Algebraic and Transcendental Elements 


As we said before, most of the rest of this text is devoted to the study of zeros of 
polynomials. We commence this study by putting an element of an extension field E of 
a field F into one of two categories. 


An element a of an extension field E of a field F is algebraic over F if f(~) =0 
for some nonzero f(x) € F[x]. If @ is not algebraic over F,, then w is transcendental 
over F.  ] 
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Proof 


Extension Fields 


C is an extension field of Q. Since ./2 is azero of x? — 2, we see that /2 is an algebraic 
element over Q. Also, i is an algebraic element over Q, being a zero of xetd. A 


It is well known (but not easy to prove) that the real numbers 7 and e are transcendental 
over Q. Here ¢ is the base for the natural logarithms. A 


Just as we do not speak simply of an irreducible polynomial, but rather of an irre- 
ducible polynomial over F, similarly we don’t speak simply of an algebraic element, 
but rather of an element algebraic over F. The following illustration shows the reason 
for this. 


The real number z is transcendental over Q, as we stated in Example 29.8. However, 
x is algebraic over R, for it is a zero of (x — 7) € R[x]. A 


It is easy to see that the real number V1 + V3 is algebraic over Q. Forifa = V1 + 73, 
then a? = 1+ /3, so a? — 1 = /3 and (a? — 1)? = 3. Therefore a* — 2a? -2 =0, 
so a is a zero of x* — 2x? — 2, which is in Q[x]. A 


To connect these ideas with those of number theory, we give the following definition. 


An element of C that is algebraic over Q is an algebraic number. A transcendental 
number is an element of C that is transcendental over Q. | 


There is an extensive and elegant theory of algebraic numbers. (See the Bibliogra- 
phy.) 

The next theorem gives a useful characterization of algebraic and transcendental 
elements over F in an extension field £ of F. It also illustrates the importance of our 
evaluation homomorphisms @,. Note that once more we are describing our concepts in 
terms of mappings. 


Let E be an extension field of a field F and let a € E. Let d, : F[x] — E be the 
evaluation homomorphism of F'[x] into E such that ¢,(a) = a fora € F andg@,(x) = a. 
Then @ is transcendental over F if and only if @, gives an isomorphism of F[x] with a 
subdomain of £, that is, if and only if @, is a one-to-one map. 


The element a is transcendental over F if and only if f(@) 4 0 for all nonzero f(x) € 
F[x], which is true (by definition) if and only if d.(f(x)) ¥ 0 for all nonzero f(x) € 
F[x], which is true if and only if the kernel of @, is {0}, that is, if and only if @, is a 
one-to-one map. ¢ 


The Irreducible Polynomial for a over F 


Consider the extension field R of Q. We know that /2 is algebraic over Q, being a zero of 
x? — 2. Of course, /2 is also a zero of x3 — 2x and of.x* — 3x2 +2 = (x? — 2)(x? — 1). 
Both these other polynomials having ./2 as a zero were multiples of x? — 2. The next 
theorem shows that this is an illustration of a general situation. This theorem plays a 
central role in our later work. 


29.13 Theorem 


Proof 


29.14 Definition 


29.15 Example 
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Let & be an extension field of F’, andleta € E, where wis algebraic over F. Then there is 
an irreducible polynomial p(x) € F[x] such that p(@) = 0. This irreducible polynomial 
p(x) is uniquely determined up to a constant factor in F and is a polynomial of minimal 
degree >1 in F[x] having a as a zero. If f(~) = 0 for f(x) € F[x], with f(x) = 2. 
then p(x) divides f(x). 


Let @, be the evaluation homomorphism of F [x] into £, given by Theorem 22.4. The 
kernel of ¢, is an ideal and by Theorem 27.24 it must be a principal ideal generated by 
some p(x) € F[x]. Now (p(x)} consists precisely of those elements of F[x] having a 
as a zero. Thus, if f(@) = 0 for f(x) #0, then f(x) € (p(x)), so p(x) divides f(x). 
Thus p(x) is a polynomial of minimal degree > 1 having @ as a zero, and any other such 
polynomial of the same degree as p(x) must be of the form (a) p(x) for some a € F. 

It only remains for us to show that p(x) is irreducible. If p(x) = r(x)s(x) were a 
factorization of p(x) into polynomials of lower degree, then p(a@) = 0 would imply that 
r(a)s(c) = 0, so either r(~) = 0 or s(a) = 0, since E is a field. This would contradict 
the fact that p(x) is of minimal degree >1 such that p(a@) = 0. Thus p(x) is irredu- 
cible. 


By multiplying by a suitable constant in F, we can assume that the coefficient of the 
highest power of x appearing in p(x) of Theorem 29.13 is 1. Such a polynomial having 1 
as the coefficient of the highest power of x appearing is a monic polynomial. 


Let E be an extension field of a field F’, and let w € E be algebraic over F’. The unique 
monic polynomial p(x) having the property described in Theorem 29.13 is the irre- 
ducible polynomial for a over F and will be denoted by irr(a@, F). The degree of 
irr(a, F’) is the degree of « over F, denoted by deg(a, F). | 


We know that irr(./2, Q) = x? — 2. Referring to Example 29.10, we see that for a = 
V1+ +3 in R, a is a zero of x* — 2x? — 2, which is in Q[x]. Since x+ — 2x? — 2 is 
irreducible over Q (by Eisenstein with p = 2, or by application of the technique of 
Example 23.14), we see that 


int(V1 + V3, Q = xt — 2x? - 2. 
Thus / 1+ V3 is algebraic of degree 4 over Q. A 


Just as we must speak of an element @ as algebraic over F rather than simply as 
algebraic, we must speak of the degree of a over F rather than the degree of a. To take 
a trivial illustration, /2 € R is algebraic of degree 2 over Q but algebraic of degree 1 
over R, for inr(V2, R)=x- J/2. 

The quick development of the theory here is due to the machinery of homomorphisms 
and ideal theory that we now have at our disposal. Note especially our constant use of 
the evaluation homomorphisms @¢,. 


Simple Extensions 


Let E be an extension field of a field F, and let aw € E. Let ¢, be the evaluation homo- 
morphism of F [x] into E with ¢,.(a) = a fora € F and é,(x) = a, as in Theorem 22.4. 
We consider two cases. 
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29.16 Example 


29.17 Definition 


29.18 Theorem 


Proof 
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CaseI Suppose « is algebraic over F. Then as in Theorem 29.13, the kernel of 
oy is (irr(a, F)) and by Theorem 27.25, (irr(a, F)) is a maximal ideal 
of F[x]. Therefore, F[x]/tirr(a@, F)) is a field and is isomorphic to the 
image ¢y[F [x]] in E. This subfield ¢,[F[x]] of E is then the smallest 
subfield of E containing F and a. We shall denote this field by F(a). 


Case Il Suppose «a is transcendental over F. Then by Theorem 29.12, ¢, gives 
an isomorphism of F'[x] with a subdomain of £. Thus in this case 
éy[F [x] is nota field but an integral domain that we shall denote by 
F[a]. By Corollary 21.8, E contains a field of quotients of F[a], which 
is thus the smallest subfield of Z containing F and a. As in Case I, we 
denote this field by F(a). 


Since w is transcendental over Q, the field Q(z) is isomorphic to the field Q(x) of 
rational functions over Q in the indeterminate x. Thus from a structural viewpoint, an 
element that is transcendental over a field F behaves as though it were an indeterminate 
over F. A 


An extension field E of a field F is a simple extension of F if E = F(a) for some 
aék. a 


Many important results appear throughout this section. We have now developed so 
much machinery that results are starting to pour out of our efficient plant at an alarming 
rate. The next theorem gives us insight into the nature of the field F(a) in the case where 


-o is algebraic over F’. 


Let E be a simple extension F(a) of a field F, and let a be algebraic over F. Let 
the degree of irr(a, F) be n > 1. Then every element § of E = F(a) can be uniquely 
expressed in the form 


B=botbiat--- +b, 10", 
where the b; are in F. 
For the usual evaluation homomorphism ¢,, every element of 
F(a) = dof FIX] 
is of the form ¢.(f(x)) = f(a), a formal polynomial in with coefficients in F’. Let 
in(a, F) = p(x) =x" + big xO ee 
Then p(a) = 0, so 
a” = —a,_ja"-!—.-.—ap. 


This equation in F(a) can be used to express every monomial w” for m > n in terms of 
powers of @ that are less than n. For example, 


29.19 Example 


be 
~ 
—_ 
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a) = ear” = —ay_ya" — ay_p0"~! — --- — aga 


= —An—1(—ay—10"~| — ++» — a9) — Gy_20""| — ++ — ao. 
Thus, if B € F(a), 6 can be expressed in the required form 
B=botbiat +b 10". 
For uniqueness, if 
bo tba +--+ + bye" = bh + bia t---+d)_,a" 
for bj € F, then 
(bo — bg) + (bi — By) ++ + Gna 1 = bye"! = BCX) 


isin F[x] and g(a) = 0. Also, the degree of g(x) is less than the degree of irr(a, F). 
Since irr(@, F) is a nonzero polynomial of minimal degree in F[x] having @ as a zero, 
we must have g(x) = 0. Therefore, b; — b; = 0, so 


b; = by, 
and the uniqueness of the ; is established. ¢ 


We give an impressive example illustrating Theorem 29.18. 


The polynomial p(x) = x? +x +1 in Z,[x] is irreducible over Zy by Theorem 23.10, 
since neither element 0 nor element 1 of Zo is a zero of p(x). By Theorem 29.3, we 
know that there is an extension ficld E of Zz containing a zero a of x? +x +1. By 
Theorem 29.18, Z2(a) has as elements 0 + Ow, 1 + O0a,0-+ la, and 1 + 1a, that is, 0, 
1, a, and 1 +a. This gives us a new finite field, of four elements! The addition and 
multiplication tables for this field are shown in Tables 29.20 and 29.21. For example, to 
compute (1 + a)(1 + a) in Z2(a), we observe that since p(a) = a? +a +1 = 0, then 
oe =-a-l=a+1. 

Therefore, 

(Q+ea)\1l+ea)=Ilteta+oe’?=1l+e7?=l1+atl=a. A 


Finally, we can use Theorem 29.18 to fulfill our promise of Example 29.4 and 
show that R[x]/(x? + 1) is isomorphic to the field C of complex numbers. We saw in 
Example 29.4 that we can view R[x]/(x? + 1) as an extension field of R. Let 


a=x4+ 741). 


29.20 Table 29.21 Table 
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Then R(a) = R[x]/(x? + 1) and consists of all elements of theforma + ba for a,b ER, 
by Theorem 29.18. But since a2 +1=0, we see that a plays the role of i ¢ C, and 
a+ ba plays the role of (a + bi) € C. Thus R(a) ~ C. This is the elegant algebraic 
way to construct C from R. 
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Computations 

In Exercises 1 through 5, show that the given number @ € C is algebraic over Q by finding f(x) € Q[x] such that 
| fla) = 0. 
| eee) 2. /24+ V3 3. 1+i 


| 4. Jl +42 a ee 


In Exercises 6 through 8, find irr(a, Q) and deg(q, Q) for the given algebraic number a € C. Be prepared to prove 
that your polynomials are irreducible over Q if challenged to do so. 


6. V3-— V6 7 JQ+V7 8. J2+i 


In Exercises 9 through 16, classify the given a € C as algebraic or transcendental over the given field F. If w is 
algebraic over F, find deg(a, F). 


9a =i,F =Q 10.¢=1+i,F =R 
ll. a= /z,F =Q 12.0=./%7,F =R 
13. « = fa, F = Qi) 14.0 =77,F =Q 
15. @=77,F = Q(z) 16. a = 27, F = Qt’) 


17. Refer to Example 29.19 of the text. The polynomial x2 +x + 1has a zero a in Z2(@) and thus must factor into 
a product of linear factors in (Z2(a))[*]. Find this factorization. [Hint: Divide x? +x+1 by x —a by long 
division, using the fact that oe =at1.] 

18. a. Show that the polynomial x? + | is irreducible in Zs[*]. 

b. Let a be a zero of x? + 1 in an extension field of Z;. As in Example 29.19, give the multiplication and 
addition tables for the nine elements of Z3(a), written in the order 0, 1, 2, a, 2a, 1 +a, 1+2a,2-+ a, and 
2+ 2a. 


Concepts 

In Exercises 19 through 22, correct the definition of the italicized term without reference to the text, if correction 

is needed, so that it is in a form acceptable for publication. 

19. An element a of an extension field E of a field F if algebraic over F if and only if is a zero of some 
polynomial. 

20. An element of an extension field E ofa field F is transcendental over F if and only if f is not a zero of any 
polynomial in F[x]. 

21, A monic polynomial in F[x]is one having all coefficients equal to 1. 


22. A field E is a simple extension of a subfield F if and only if there exists some a € E such that no proper 
subfield of £ contains a. 


23. 


24, 


25. 


26. 


27. 
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Mark each of the following true or false. 


a. The number z is transcendental over Q. 
b. C is a simple extension of R. 


¢e. Every element of a field F is algebraic over F. 


—___. d. Ris an extension field of Q. 

e. Q is an extension field of Zp. 

f. Let a € C be algebraic over Q of degree n. If f(@~)=0 for nonzero f(x) € Q[+], then 
(degree f(x)) >n. 

g. Let a EC be algebraic over Q of degree n. If f(a) =0 for nonzero f(x) € R[x], then 
(degree f(x)) > n. 

——__— h. Every nonconstant polynomial in F'[x] has a zero in some extension ficld of F. 


i. Every nonconstant polynomial in F [x] has a zero in every extension field of F. 
j. If x is an indeterminate, Q[7] ~ Q[x]. 


We have stated without proof that 7 and ¢ are transcendental over Q. 


a. Find a subfield F of R such that mz is algebraic of degree 3 over F. 
b. Find a subfield E of R such that e? is algebraic of degree 5 over E. 


a. Show that x? + x? + 1 is irreducible over Zp. 


b. Let w be a zero of x3 + x? + 1 in an extension field of Z.. Show that x? + x? + 1 factors into three linear 
factors in (Z2(a))[x] by actually finding this factorization. [Hint: Every element of Z2(a) is of the form 


. ay tae tana? for a; =0,1. 
Divide x? + x* + 1 by x — a by long division. Show that the quotient also has a zero in Z(a) by simply 
trying the eight possible elements. Then complete the factorization. | 
Let E be an extension field of Z, and let aw € E be algebraic of degree 3 over Z,. Classify the groups (Z2(a), +) 
and ((Z2(a))*, -}) according to the Fundamental Theorem of finitely generated abelian groups. As usual, (Zo(@))* 
is the set of nonzero elements of Z2(a). 


Let E be an extension field ofafield F andleta <¢ E be algebraic over F'. The polynomial irr(a, F’) is sometimes 
referred to as the minimal polynomial for w over F’. Why is this designation appropriate? 


Proof Synopsis 


28. 


Give a two- or three-sentence synopsis of Theorem 29.3. 


Theory 


29. 


30. 


31. 


Let E be an extension field of F, and leta, 8 € E. Suppose & is transcendental over F but algebraic over F'(B). 
Show that f is algebraic over F(q@). 


Let E be an extension field of a finite field F, where F has g elements. Leta € E be algebraic over F of degree 
n. Prove that F(a) has g” elements. 


a. Show that there exists an irreducible polynomial of degree 3 in Zs[x]. 
b. Show from part (a) that there exists a finite field of 27 elements. [Hint: Use Exercise 30.] 
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32. Consider the prime field Z, of characteristic p # 0. 


33. 


34. 


35 


36. 


37. 


a. Show that, for p # 2, not every element in Z, is a square of an element of Z,. [Hint: V=(p-1P=1 
in Zp. Deduce the desired conclusion by counting.| 


b. Using part (a), show that there exist finite fields of p* elements for every prime p in Z*. 
YP. P 


Let E be an extension field of a field F and let a € E be transcendental over F. Show that every element 
of F(a) that is not in F is also transcendental over F. 


Show that {a + b(x/2) + e(/2)? la, b,c € Q} is a subfield of R by using the ideas of this section, rather than 
by a formal verification of the field axioms. [Hint: Use Theorem 29.18.] 


Following the idea of Exercise 31, show that there exists a field of 8 elements; of 16 elements; of 25 elements. 


Let F be a finite field of characteristic p. Show that every element of F is algebraic over the prime field Zp < F. 
[Hint: Let F* be the set of nonzero elements of F. Apply group theory to the group (F™, -) to show that every 
a € F* is a zero of some polynomial in Z,[x] of the form x” — 1.] 


Use Exercises 30 and 36 to show that every finite field is of prime-power order, that is, it has a prime-power 
number of elements. 


VECTOR SPACES 


The notions of a vector space, scalars, independent vectors, and bases may be familiar. 
In this section, we present these ideas where the scalars may be elements of any field. 
We use Greek letters like w and f for vectors since, in our application, the vectors will 
be elements of an extension field E of a field F. The proofs are all identical with those 
often given in a first course in linear algebra. If these ideas are familiar, we suggest 
studying Examples 30.4, 30.8, 30.11, 30.14, and 30.22, and then reading Theorem 30.23 
and its proof. If the examples and the theorem are understood, then do some exercises 
and proceed to the next section. 


Definition and Elementary Properties 


The topic of vector spaces is the cornerstone of linear algebra. Since linear algebra is not 
the subject for study in this text, our treatment of vector spaces will be brief, designed 
to develop only the concepts of linear independence and dimension that we need for our 
field theory. 

The terms vector and scalar are probably familiar from calculus. Here we allow 
scalars to be elements of any field, not just the real numbers, and develop the theory by 
axioms just as for the other algebraic structures we have studied. 


30.1 Definition Let F bea field. A vector space over F (or F-vector space) consists of an abelian group 
V under addition together with an operation of scalar multiplication of each element of 
V by each element of F on the left, such that for alla, b € F and a, 8 € V the following 
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conditions are satisfied: 


F. ane V. 

B,. alba) = (ab)a. 

Zs. (a + b)a = (aa) + (ba). 
Yj, ala + B) = (aa) + (af). 
Be. la=a, 


The elements of V are vectors and the elements of F are scalars. When only one field 
F is under discussion, we drop the reference to F and refer to a vector space. a 


Note that scalar multiplication for a vector space is not a binary operation on one 
set in the sense we defined it in Section 2. It associates an element aa of V with each 
ordered pair (a, w), consisting of an element a of F and an element a of V. Thus scalar 
multiplication is a function mapping F x V into V. Both the additive identity for V, the 
0-vector, and the additive identity for F’, the 0-scalar, will be denoted by 0. 

30.2 Example Consider the abelian group (R,, ++) = R x Rx.--- x Rforn factors, which consists of 
ordered n-tuples under addition by components. Define scalar multiplication for scalars 


in R by 


ra — (ray,-++,6ap) 


Historica Note 


Ihe ideas behind the abstract notion of a vector 

space occurred in many concrete examples dur- 
ing the nineteenth century and earlier. For example, 
William Rowan Hamilton dealt with complex num- 
bers explicitly as pairs of real numbers and, as noted 
in Section 24, also dealt with triples and eventually 
quadruples of real numbers in his invention of the 
quaternions. In these cases, the “vectors” turned out 
to be objects which could both be added and multi- 
plied by scalars, using “reasonable” rules for both 
of these operations. Other examples of such objects 
included differential forms (expressions under in- 
tegral signs) and algebraic integers. 

Although Hermann Grassmann (1809-1877) 
succeeded in working out a detailed theory of n- 
dimensional spaces in his Die Lineale Ausdehnung- 
slehre of 1844 and 1862, the first mathematician 
to give an abstract definition of a vector space 


equivalent to Definition 30.1 was Giuseppe Peano 
(1858-1932) in his Calcolo Geometrico of 1888. 
Peano’s aim in the book, as the title indicates, 
was to develop a geometric calculus. According to 
Peano, such a calculus “consists of a system of op- 
erations analogous to those of algebraic calculus, 
but in which the objects with which the calcula- 
tions are performed are, instead of numbers, geo- 
metrical objects.” Curiously, Peano’s work had no 
immediate effect on the mathematical scene. Al- 
though Hermann Weyl (1885-1955) essentially re- 
peated Peano’s definition in his Space-Time-Matter 
of 1918, the definition of a vector space did not 
enter the mathematical mainstream until it was an- 
nounced for a third time by Stefan Banach (1892- 
1945) in the 1922 publication of his dissertation 
dealing with what we now call Banach spaces, com- 
plete normed vector spaces. 
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30.4 Example 


30.5 Theorem 


Proof 


30.6 Definition 
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forr € Randa = (a,---,a,) € R". With these operations, R” becomes a vector space 
over RR. The axioms for a vector space are readily checked. In particular, R’ = R x Ras 
a vector space over R can be viewed as all “vectors whose starting points are the origin 
of the Euclidean plane” in the sense often studied in calculus courses. A 


For any field F, F[x] can be viewed as a vector space over F, where addition of vectors 
is ordinary addition of polynomials in F [x] and scalar multiplication aw of an element 
of F[x] by an element of F is ordinary multiplication in F[x]. The axioms Y through 
Yz for a vector space then follow immediately from the fact that F[x] is a ring with 
unity. A 


Let E be an extension field of a field F. Then E can be regarded as a vector space over 
F,, where addition of vectors is the usual addition in E and scalar multiplication aq is 
the usual field multiplication in E with a € F and a € E. The axioms follow at once 
from the field axioms for E. Here our field of scalars is actually a subset of our space of 
vectors. It is this example that is the important one for us. A 


We are assuming nothing about vector spaces from previous work and shall prove 
everything we need from the definition, even though the results may be familiar from 
calculus. 


If V is a vector space over F, then Ow = 0, a0 = 0 and (—a)a = a(—a) = —(aqa) for 
alae FandaweV. 


The equation 0a = 0 is to be read “(0-scalar)a = 0-vector.” Likewise, a0 = 0 is to be 
read “a(0-vector) = 0-vector.” The proofs here are very similar to those in Theorem 18.8 
for a ring and again depend heavily on the distributive laws FY, and Z,. Now 


(Oa) = (0 + O)a = Ow) + (Or) 


is an equation in the abelian group (V, +), so by the group cancellation law, 0 = Ow. 
Likewise, from 


a0 =a(0 +0) = a0 +4 a0, 
we conclude that a0 = 0. Then 
0=0a = (a+ (-a)a = aa + (—a)a, 
so (—a)a = —(aa). Likewise, from 
0=a0=a(a +(-a@)) = aa +a(—a@), 


we conclude that a(—a) = —(aq) also. Sd 


Linear Independence and Bases 


Let V be a vector space over F. The vectors in a subset S = {a; |i € J} of V span (or 
generate) V if for every 8 € V, we have 


B= a0, + an0i, $+++ + andi, 


for some a; € F anda;, € S,j =1,--+,n. A vector Dee, aj;q;, is a linear combina- 
tion of the a, ,. | 


30.7 Example 


30.8 Example 


30.9 Definition 


30.10 Example 


30.11 Example 


30.12 Definition 


30.13 Example 


30.14 Example 


Section 30 Vector Spaces 277 


In the vector space IR” over R of Example 30.2, the vectors 
(1,0,+++, 0), (0, 1,+++,0), +++, (0,0,-+-. 1) 
clearly span R", for 
(41, 42,°°+,4) = 4 (1,0,---,0)+a00,1,---,0)+---+a,(0,0,---. 1). 


Also, the monomials x” for m > O span F[x] over F, the vector space of Example 30.3. 


> 


Let F be a field and & an extension field of F. Let a € E be algebraic over F. Then 
F(a) is a vector space over F and by Theorem 30.18, it is spanned by the vectors in 
{1, a, -++,@"—'}, where n = deg(a, F). This iy the important example for us. A 


A vector space V over a field F is finite dimensional if there is a finite subset of V 
whose vectors span V. a 


Example 30.7 shows that R” is finite dimensional. The vector space F'[x] over F is 
not finite dimensional, since polynomials of arbitrarily large degree could not be linear 
combinations of elements of any finite set of polynomials. A 


If F < E anda € E is algebraic over the field F, Example 30.8 shows that F(a) is a 
finite-dimensional vector space over F’. This is the most important example for us. & 


The next definition contains the most important idea in this section. 


The vectors in a subset S = {a; |i € I} of a vector space V over a field F are linearly 
independent over F if, for any distinct vectors ai, E S, coefficients a; € F andn ¢€ Zr, 
we have Dee aja;, = 0 in V only if a; =0 for j = 1,---,n. If the vectors are not 
linearly independent over F’, they are linearly dependent over F. | 


Thus the vectors in {a; |i € 7} are linearly independent over F if the only way the 
0-vector can be expressed as a linear combination of the vectors a; is to have all scalar 
coefficients equal to 0. If the vectors are linearly dependent over F,, then there exist 
a; € F for j =1,---,n such that pat Fj Mi; = 0, where not all a; = 0. 


Observe that the vectors spanning the space R” that are given in Example 30.7 are linearly 
independent over R. Likewise, the vectors in {x” |m > 0} are linearly independent 
vectors of F [x] over F. Note that (1, —1), (2, 1), and (—3, 2) are linearly dependent in 
R? over R, since 


71, -D + @, 1)4+ 3(-3, 2) = (0, 0) = 0. A 


Let E be an extension field of a field F, andleta € E be algebraic over F. If deg(a. F) = 
n, then by Theorem 29.18, every element of F(a) can be uniquely expressed in the form 


bo tbya+t---+ b,-ya" | 


for b; € F. In particular, 0 = 0 + Ow + --- + 0a"! must be a unique such expression 
for 0. Thus the elements 1, a, ---,@”~! are linearly independent vectors in F(a) over 
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30.15 Definition 


30.16 Lemma 


Proof 


30.17 Theorem 


Proof 
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the field F. They also span F(a), so by the next definition, 1, @,---, a"! form a basis 
for F(a) over F. This is the important example for us. In fact, this is the reason we are 
doing this material on vector spaces. A 


If V is a vector space over a field F’, the vectors in a subset B = {6; |i € J} of V form 
a basis for V over F if they span V and are linearly independent. a 


Dimension 


The only other results we wish to prove about vector spaces are that every finite- 
dimensional vector space has a basis, and that any two bases of a finite-dimensional 
vector space have the same number of clements. Both these facts are true without the 
assumption that the vector space is finite dimensional, but the proofs require more knowl- 
edge of set theory than we are assuming, and the finite-dimensional case is all we need. 
First we give an easy lemma. 


Let V be a vector space over a field F, and leta € V. If o is a linear combination of 
vectors 8; in V fori = 1,---,m and each 8; is a linear combination of vectors y; in V 
for j =1,-+-,n, then @ is a linear combination of the y;. 


Leta = > \y_, a; B;, and let Bj = eS bijyvj, where a; and b;; are in F. Then 


m n An m 
a Fa( Sam] =e (Soa) 
jel i=l 


i=l j=l 


and (> 7-4 ajbij) € F. ¢ 


Ina finite-dimensional vector space, every finite set of vectors spanning the space contains 
a subset that is a basis. 


Let V be finite dimensional over F’, and let vectors a, +++, Qn in V span V. Let us list 
the a; in a row. Examine each a; in succession, starting at the left with i = 1, and discard 
the first w; that is some linear combination of the preceding «; fori < j. Then continue, 
starting with the following @ j+1, and discard the next a; that is some linear combination 
of its remaining predecessors, and so on. When we reach of after a finite number of 
steps, those a; remaining in our list are such that none is a linear combination of the 
preceding ; in this reduced list. Lemma 30.16 shows that any vector that is a linear 
combination of the original collection of a; is still a Linear combination of our reduced, 
and possibly smaller, set in which no a; is a linear combination of its predecessors. Thus 
the vectors in the reduced set of a; again span V. 
For the reduced set, suppose that 


Ayd;, Hee +a; = 0 


for iy <in <-+++ <i, and that some a; #0. We may assume from Theorem 30.5 
that a, #0, or we could drop a,a;, from the left side of the equation. Then, using 


30.18 Corollary 
Proof 


30.19 Theorem 


Proof 
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Theorem 30.5 again, we obtain 


ay ay} 
Oe ee Oa i Qi,» 
a, a; 


which shows that a;, is a linear combination of its predecessors, contradicting our con- 
struction. Thus the vectors «; in the reduced set both span V and are linearly independent, 
so they form a basis for V over F. ° 


A finite-dimensional vector space has a finite basis. 


By definition, a finite-dimensional vector space has a finite set of vectors that span the 
space. Theorem 30.17 completes the proof. Sd 


The next theorem is the culmination of our work on vector spaces. 


Let S = {a, ---, a, } bea finite set of linearly independent vectors of a finite-dimensional 
vector space V overa field F. Then S can be enlarged to a basis for V over F. Furthermore, 
if B = {B,,---, B,} is any basis for V over F, thenr <n. 


By Corollary 30.18, there is a basis B = {6,,---, Bn} for V over F. Consider the finite 
sequence of vectors 


Oy, A, Bi, e+, Bp. 


These vectors span V, since B is a basis. Following the technique, used in Theorem 30.17, 
of discarding in turn each vector that is a linear combination of its remaining predecessors, 
working from left to right, we arrive at a basis for V. Observe that no @; is cast out, since 
the a; are linearly independent. Thus S can be enlarged to a basis for V over F. 

For the second part of the conclusion, consider the sequence 


1, Bi, +++, Bn. 
These vectors are not linearly independent over F, because a is a linear combination 
a = bj Bi +--+ + bnBn, 
since the 8; form a basis. Thus 
ay + (—b) Bi + +++ + (—bp) Bn = O. 


The vectors in the sequence do span V, and if we form a basis by the technique of 
working from left to right and casting out in turn each vector that is a linear combination 
of its remaining predecessors, at least one 6; must be cast out, giving a basis 


{ax, pe. a JB his 


where m <n — 1. Applying the same technique to the sequence of vectors 
1 1 
01, a2, Bi Tse Be. 


we atrive at a new basis 


{a1, a, pare aa 8 


TO 
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30.20 Corollary 


Proof 


30.21 Definition 


30.22 Example 


30.23 Theorem 


Proof 
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with s < n — 2. Continuing, we arrive finally at a basis 


Cer er a 


where 0 <t <n—r.Thusr <n. 5 


Any two bases of a finite-dimensional vector space V over F have the same number of 
elements. 


Let B = {Bi,---, Bn} and B’ = {Bi. +++. By} be two bases. Then by Theorem 30.19, 
regarding B as an independent set of vectors and B’ as a basis, we see thatn <m. A 
symmetric argument gives m <n, som =n. ° 


If V is a finite-dimensional vector space Over a field F , the number of elements in a basis 
(independent of the choice of basis, as just shown) is the dimension of V over F. 


Let E be an extension field of a field F, and leta € E. Example 30.14 shows that if a is 
algebraic over F and deg(a, F) =n, then the dimension of F(a) as a vector space over 
F isn. This is the important example for us. A 


An Application to Field Theory 


We collect the results of field theory contained in Examples 30.4, 30.8, 30.11, 30.14, and 
30.22, and incorporate them into one theorem. The last sentence of this theorem gives 
an additional nice application of these vector space ideas to field theory. 


Let E be anextension field of F, andletw € E be algebraic over F. Ifdeg(a, F) =n, then 
F(a) is an n-dimensional vector space over F with basis {1, @,---, a”), Furthermore, 
every element 6 of F(a) is algebraic over F, and deg(f, F) < deg(q, F). 


We have shown everything in the preceding examples except the very important result 
stated in the last sentence of the above theorem. Let 6 € F(a), where a is algebraic over 
F of degree n. Consider the elements 


ieee eee ee 
These cannot be n + 1 distinct elements of F(a) that are linearly independent over F, 
for by Theorem 30.19, any basis of F(a) over F would have to contain at least as many 
elements as are in any set of linearly independent vectors over F. However, the basis 
{1,a,---,@”~!} has just 2 elements. If gi = B/, then B' — B/ = 0, so in any case there 
exist b; € F such that 


bo + biB + bop? +--+ + dnb” =0, 


where not all b; = 0. Then f(x) = Dax” te + bx + bg is a nonzero element of F[x] 
such that f(6) = 0. Therefore, B is algebraic over F and deg(8, F’) is at most n. Sd 


i EXERCISES 30 


1. Find three bases for R? over R, no two of which have a vector in common. 


In Exercises 2 and 3, determine whether the given set of vectors is a basis for R* over R. 
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2. {(, 1, 0), C1, 0, 1), ©, 1, 1D} 3. {(-1, 1, 2), (2, —3, 1), (10, —14, 0)} 
In Exercises 4 through 9, give a basis for the indicated vector space over the field. 
4. QU/2) over Q 5. RJ/2) over R 
6. QG/2) over Q 7. CoverR 
8. QG) over Q 9. Q(/2) over Q 
10. According to Theorem 30.23, the element 1+ a@ of Z,(a) of Example 29.19 is algebraic over Zy. Find the 
irreducible polynomial for 1 + @ in Zo[x]. 
Concepts 


In Exercises 11 through 14, correct the definition of the italicized term without reference to the text, if correction 
is needed, so that it is in a form acceptable for publication. 


11. 


12. 


13. 


14. 
15. 


The vectors in a subset S of a vector space V over a field F span V if and only if each B € V can be expressed 
uniquely as a linear combination of the vectors in S. 


The vectors in a subset S$ of a vector space V over a field F are linearly independent over F if and only if the 
zero vector cannot be expressed as a linear combination of vectors in S. 


The dimension over F of a finite-dimensional vector space V over a field F is the minimum number of vectors 
required to span V. 


A basis for a vector space V over a field F is a set of vectors in V that span V and are linearly dependent. 
Mark each of the following true or false. 


a. The sum of two vectors is a vector. 
The sum of two scalars is a vector. 


The product of two scalars is a scalar. 
The product of a scalar and a vector is a vector. 


The vectors in a basis are linearly dependent. 
The 0-vector may be part of a basis. 
If F < E anda € E is algebraic over the field F, then a? is algebraic over F. 
i, If F < E anda ¢ E is algebraic over the field F, then a + a? is algebraic over F. 


db. 
c. 
d. 
e. Every vector space has a finite basis. 
f. 
g. 
h. 


j. Every vector space has a basis. 


The exercises that follow deal with the further study of vector spaces. In many cases, we are asked to define for 
vector spaces some concept that is analogous to one we have studied for other algebraic structures. These exercises 
should improve our ability to recognize parallel and related situations in algebra. Any of these exercises may assume 
knowledge of concepts defined in the preceding exercises. 


16. 


17. 


18. 


Let V be a vector space over a field F. 

a. Define a subspace of the vector space V over F. 

b. Prove that an intersection of subspaces of V is again a subspace of V over F. 

Let V be a vector space over a field F’, and let § = {a; |i € J} be a nonempty collection of vectors in V. 
a. Using Exercise 16(b), define the subspace of V generated by S. 


b. Prove that the vectors in the subspace of V generated by S are precisely the (finite) linear combinations of 
vectors in S. (Compare with Theorem 7.6.) 


Let V;,---, V, be vector spaces over the same field F. Define the direct sum V; @ --+ ® V,, of the vectors 
spaces V; fori = 1,---,n, and show that the direct sum is again a vector space over F. 


aa... eee E_:|:YS- 
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19. Generalize Example 30.2 to obtain the vector space F" of ordered n-tuples of elements of F over the field F, 
for any field F. What is a basis for F"? 


20. Define an isomorphism of a vector space V over a field F with a vector space V’ over the same field F’. 


Theory 
21. Prove that if V is a finite-dimensional vector space over a field F, then a subset {6;, B2,---» Bn} of V is a basis 
for V over F if and only if every vector in V can be expressed uniquely as a linear combination of the fj. 


22. Let F be any field. Consider the “system of m simultaneous linear equations inn unknowns” 
aX, + ayXo t+ FainXn =r, 
ay X1 + Gog X2 t+ + GanXn = br, 


Ami X1 + Am2X2 +--+ + Ann Xn = Dns 
where a;;, 5; € F. 


a. Show that the “system has a solution,” that is, there exist Xj,---, Xn € F that satisfy all m equations, if 
and only if the vector 8 = (by,++*) bm) of F” is a linear combination of the vectors aj; = (Qijrvots mj): 
(This result is straightforward to prove, being practically the definition of a solution, but should really be 
regarded as the fundamental existence theorem for a simultaneous solution of a system of linear equations.) 
b. From part (a), show that ifn =mand{o;|j=l-: _n} is a basis for F”, then the system always has a 
unique solution. 
23. Prove that every finite-dimensional vector space V of dimension n over a field F is isomorphic to the vector 
space F” of Exercise 19. 


24. Let V and V’ be vector spaces over the same field F. A function @ : V > V’ is alinear transformation of V 
into V’ if the following conditions are satisfied for alla, B € V anda € F: 


g(a + B) = o(a) + $(). 
plaw) = a(b(@)). 
a. If (6; |i € I} isa basis for V over F, show that a linear transformation @ : V > V’is completely determined 
by the vectors (Bi) € Vv’. 
b. Let {B; |i € Z} be a basis for V, and let {f;’ |i € 7} be any set of vectors, not necessarily distinct, of Vv’. 
Show that there exists exactly one linear transformation ¢@ : V > V’ such that o(B:) = Bi’. 
25. Let V and V’ be vector spaces over the same field F, and let @ : V > V’ bea linear transformation. 
a. To what concept that we have studied for the algebraic structures of groups and rings does the concept ofa 
linear transformation correspond? 
b. Define the kernel (or nullspace) of ¢, and show that it is a subspace of V. 
c. Describe when ¢ is an isomorphism of V with V’. 
26. Let V be a vector space over a field F, and let S be a subspace of V. Define the quotient space V/S, and show 
that it is a vector space over F’. 
27. Let V and V’ be vector spaces over the same field F, and let V be finite dimensional over F. Let dim(V) be 
the dimension of the vector space V over F. Let @: V > V’ bea linear transformation. 
a. Show that o[V] is a subspace of Vv’. 


b. Show that dim(@[V]) = dim(V) — dim(Ker(#)). [Hint: Choose a convenient basis for V, using Theo- 
rem 30.19. For example, enlarge a basis for Ker (#) to a basis for V.] 


31.1 Definition 


31.2 Definition 


31.3 Theorem 
Proof 


31.4 Theorem 
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ALGEBRAIC EXTENSIONS 


Finite Extensions 


In Theorem 30.23 we saw that if E is an extension field of a field F anda € E is algebraic 
over F, then every element of F(a) is algebraic over F’. In studying zeros of polynomials 
in F[x], we shall be interested almost exclusively in extensions of F containing only 
elements algebraic over F. 


An extension field FE’ of a field F is an algebraic extension of F if every element in E 
is algebraic over F. a 


If an extension field E of a field F is of finite dimension n as a vector space over F, then 
E is a finite extension of degree nm over F’. We shall let [E : F'] be the degree n of E 
over F. B 


To say that a field E is a finite extension of a field F does not mean that E is finite 
field. It just asserts that E is a finite-dimensional vector space over F, thatis, that [E : F'] 
is finite. 

We shall often use the fact that if F is a finite extension of F, then, [E : F] = 1 if 
and only if E = F. We need only observe that by Theorem 30.19, {1} can always be 
enlarged to a basis for E over F. Thus [E : F] = 1 if and only if F = F() = F. 

Let us repeat the argument of Theorem 30.23 to show that a finite extension FE of a 
field F must be an algebraic extension of F. 


A finite extension field £ of a field F is an algebraic extension of F’. 


We must show that fora € E, a is algebraic over F. By Theorem 30.19 if [EZ : F] =n, 
then 


cannot be linearly independent elements, so there exist a; € F such that 
Ane” +-+++ aya +ag = 0, 


and notalla; = 0.Then f(x) = a,x" + ---+ a,x + ag is anonzero polynomial in F [x]. 
and f(a) = 0. Therefore, a is algebraic over F. 4 


We cannot overemphasize the importance of our next theorem. It plays a role in 
field theory analogous to the role of the theorem of Lagrange in group theory. While its 
proof follows easily from our brief work with vector spaces, it is a tool of incredible 
power. An elegant application of it in the section that follows shows the impossibility 
of performing certain geometric constructions with a straightedge and a compass. Never 
underestimate a theorem that counts something. 


If E is a finite extension field of a field F, and K is a finite extension field of E, then K 
is a finite extension of fF, and 


[K: F]=[K: EVE: F]. 
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K 
Basis {Bj} 
Basis {oj6j}) FE 
Basis {a} 
31.5 Figure 
Proof Let {a;|i =1,--+,n} be a basis for E as a vector space over F, and let the set 
{6;|j =1,---,m} be a basis for K as a vector space over E. The theorem will be 


proved if we can show that the mn elements a; 6; form a basis for K, viewed as a vector 
space over F. (See Fig. 31.5.) 
Let y be any element of K. Since the 8; forma basis for K over E, we have 


m 
v = > dB; 
j=l 
for b; € E. Since the a; form a basis for E over F, we have 


n 
bj = ) Ajj Oj 
i=l 


for a;; € &. Then 


m n 
y= > (Soom) = > aij(@iB;), 
joi \isl i,j 
so the mn vectors a; 8; span K over F. 
Itremains for us to show that the mn elements a; 8; are independent over F’. Suppose 


that Di, jCij Gi B;) = 0, with Ciy E F. Then 


yy (Soave) = 0, 
=e 


j=l 
and (E"_,cjja;) € E. Since the elements 8; are independent over £, we must have 


n 
) CijQi = 0 
i=l 


for all j. But now the a; are independent over F’, so ye) cj = O implies that c;; = 0 
for all i and j. Thus the @;6; not only span K over F but also are independent over F. 
Thus they form a basis for K over F. . 4 


Note that we proved this theorem by actually exhibiting a basis. It is worth remem- 
bering that if {@; |¢ = 1,---, n}is a basis for E over F and {f;|j =1,---, m} is a basis 
for K over E, for fields F < EF < K, then the set {a; 8 j} of mn products is a basis for 
K over F. Figure 31.5 gives a diagram for this situation. We shall illustrate this further 
in a moment. 


31.6 Corollary 


Proof 


31.7 Corollary 


Proof 


31.8 Example 


31.9 Example 


31.10 Example 
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If F; is a field fori = 1,---,r and F;4, is a finite extension of F;, then F, is a finite 
extension of F,, and 


[Fe PI= Ce: B-ill-1: F-2]--- [Fo : Ai). 
The proof is a straightforward extension of Theorem 31.4 by induction. 5 


If E is an extension field of F, a € E is algebraic over F, and B € F(a), thendeg(f, F) 
divides deg(a, F). 


By Theorem 30.23, deg(a, F) = [F(a) : F]and deg(6, F) = [F(): F]. We have F < 
F(B) < F(a), so by Theorem 31.4 [F (8) : F] divides [F(@): F]. 5 


The following example illustrates a type of argument one often makes using Theo- 
rem 31.4 or its corollaries. 


By Corollary 31.7, there is no element of Q(/2) that is a zero of x? — 2. Note that 
deg(/2 , Q) = 2, while a zero of x3 — 2is of degree 3 over Q, but 3 does notdivide2. A 


Let E be an extension field of a field F, and let a,, a2 be elements of £, not 
necessarily algebraic over F. By definition, F(a) is the smallest extension field of 
F in E that contains a. Similarly, (F(@,))(@2) can be characterized as the smallest 
extension field of F in E containing both a, and a. We could equally have started 
with a2, so (F(@1))(a@2) = (F(@2))(a1). We denote this field by F(a, a2). Similarly, for 
a; € E, F(a, --+ op) is the smallest extension field of F in E containing all the o; for 
i =1,---,n. We obtain the field F(a, ---, @,) from the field F by adjoining to F the 
elements a; in E. Exercise 49 of Section 18 shows that, analogous to an intersection 
of subgroups of a group, an intersection of subfields of a field E is again a subfield of 
E. Thus F(a, ---,@,) can be characterized as the intersection of all subfields of E 
containing F and all the a; fori = 1,---,n. 


Consider Q(./2). Theorem 30.23 shows that {1, /2} is a basis for Q(/2) over Q. Using 
the technique demonstrated in Example 29.10, we can easily discover that JV2+3 is 
a zero of x* — 10x? + 1. By the method demonstrated in Example 23.14, we can show 
that this polynomial is irreducible in Q[x]. Thus inr(/2 + V3, Q) = x4 — 10x? + 1, 
so [Q(V2 + V3) : Q] = 4. Thus (V2 + V3) ¢ Q(/2), so V3 ¢ Q(Y2). Consequently, 
{1, /3} is a basis for Q(V2, V3) = (Q(/2))(/3) over Q(./2). The proof of Theo- 
rem 31.4 (see the comment following the theorem) then shows that {1, /2, /3, V6} is 
a basis for Q(/2, V3) over Q. A 


Let 2!/3 be the real cube root of 2 and 2!/? be the positive square root of 2. Then 
21/2 ¢ Q(2"/3) because deg(2!/?, Q) = 2 and 2 is not a divisor of 3 = deg(2'/7, Q). 
Thus [Q(2!/3, 23/2) : Q(2!/9)] = 2. Hence {1, 21/9, 27/7} is a basis for Q(21/7) over Q 
and {1, 2'/} is a basis for Q(2!/3, 21/2) over Q(2'/*). Furthermore, by Theorem 31.4 
(see the comment following the theorem), 


{1, 24/2, 21/3 25/6 92/3 97/6 


is a basis for Q(2!/2, 21/3) over Q. Because 27/6 = 2(2!/6), we have 2!/6 « Q(2!/7, 21/3). 
Now 21/6 is a zero of x° — 2, which is irreducible over Q, by Eisenstein’s criterion, with 
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31.11 Theorem 


Proof 


31.12 Theorem 


Proof 
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p = 2. Thus 
@ <Q) < Q2?, 2") 
and by Theorem 31.4 
6 = (Q2"?, 2"): QI = [Q(2!?, 24) : Q2”)Q2"*) : QI 
= 1927, 24) : Q2"*)16). 
Therefore, we must have 


(Q(2¥/7, 24/3) : Qc2/*)] = 1, 


so Q(2'/2, 21/3) = Q(2), by the comment preceding Theorem 31.3. A 
Example 31.10 shows that it is possible for an extension F(a, ---, @,) ofa field F 
to be actually a simple extension, even though nv > 1. 
Let us characterize extensions of F of the form F(a, ---, a) in the case that all 


the @; are algebraic over F. 


Let E be an algebraic extension of a field F. Then there exist a finite number of elements 
Qi,°-:,@, in E such that E = F(a,,---,a,) if and only if £ is a finite-dimensional 
vector space over F, that is, if and only if £ is a finite extension of F. 

Suppose that FE = F(ai,---,@,). Since E is an algebraic extension of F, each a;, is 
algebraic over F, so each a; is algebraic over every extension field of F in E. Thus F(a) 
is algebraic over F, and in general, F(a, ---, a,) is algebraic over F(a, ---, @;-1) for 
j =2,:---,n. Corollary 31.6 applied to the sequence of finite extensions 


F, F(o), F(a, 02), -+-, Flo, +--+, On) = E 


then shows that F is a finite extension of F. 

Conversely, suppose that FE is a finite algebraic extension of F. If [E : F] = 1, 
then E = F(1) = F, and we are done. If E # F, let a, € E, where a, ¢ F. Then 
[F(a,): F] > 1. If F(a,) = E, we are done; if not, let a. ¢ E, where a2 ¢ F(a). 
Continuing this process, we see from Theorem 31.4 that since [E : F'] is finite, we must 
arrive at w, such that 


F(aq,-++, Qn) = E. 5 


Algebraically Closed Fields and Algebraic Closures 


We have not yet observed that if E is an extension of a field F anda, 6 € E are algebraic 
over F,, thensoarea + 6, eB, a — B,anda/B, if B 4 0. This follows from Theorem 31.3 
and is also included in the following theorem. 


Let E be an extension field of F. Then 
F, = {a € E|a is algebraic over F} 
is a subfield of E, the algebraic closure of F in E. 


Leta, B € F,. Then Theorem 31.11 shows that F(a, 8) isa finite extension of F, and by 
Theorem 31.3 every element of F(a, 8) is algebraic over F, thatis, F(a, B) © Fr. Thus 


31.13 Corollary 
Proof 


31.14 Definition 


31.15 Theorem 


Proof 


31.16 Corollary 


Proof 
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Fy contains a + B, #8, a — B, and also contains w/f for 6 4 0, so Fz is a subfield of 
E, ¢ 


The set of all algebraic numbers forms a field. 


Proof of this corollary is immediate from Theorem 31.12, because the set of all algebraic 
numbers is the algebraic closure of Q in C. a 


It is well known that the complex numbers have the property that every nonconstant 
polynomial in C[x] has a zero in C. This is known as the Fundamental Theorem of 
Algebra. An analytic proof of this theorem is given in Theorem 31.18. We now give a 
definition generalizing this important concept to other fields. 


A field F is algebraically closed if every nonconstant polynomial in F[x] has a zero 
in F. a 


Note that a field F can be the algebraic closure of F in an extension field E without 
F being algebraically closed. For example, Q is the algebraic closure of Q in Q(x), but 
Q is not algebraically closed because x” + 1 has no zero in Q. 

The next theorem shows that the concept of a field being algebraically closed can 
also be defined in terms of factorization of polynomials over the field. 


A field F is algebraically closed if and only if every nonconstant polynomial in F[x] 
factors in F'[x] into linear factors. 


Let F be algebraically closed, and let f(x) be a nonconstant polynomial in F[x] 
The#i f(x) has a zero a € F. By Corollary 23.3, x — a is a factor of f(x), so f(x) = 
(x — a)g(x). Then if g(x) is nonconstant, it has a zero b € F, and we have f(x) = 
(x — a)(x — b)h(x). Continuing, we get a factorization of f(x) in F[x] into linear fac- 
tors. 

Conversely, suppose that every nonconstant polynomial of F'[x] has a factorization 
into linear factors. If ax — b is a linear factor of f(x), then b/a is a zero of f(x). Thus 
F is algebraically closed. 


An algebraically closed field F has no proper algebraic extensions, that is, no algebraic 
extensions E with F < E. 


Let E be an algebraic extension of F, so F < E. Then if a € E, we have irr(a, F) = 
x —a@, by Theorem 31.15, since F is algebraically closed. Thus a ¢ F, and we must 
have F= E. ,S 


In a moment we shall show that just as there exists an algebraically closed extension 
C of the real numbers R, for any field F there exists similarly an algebraic extension F 
of F, with the property that F is algebraically closed. Naively, to find F we proceed as 
follows. If a polynomial f(x) in F[x] has ano zero in F, then adjoin a zero @ of such an 
f(x) to F, thus obtaining the field F(a). Theorem 29.3, Kronecker’s theorem, is strongly 
used here, of course. If F(a) is still not algebraically closed, then continue the process 
further. The trouble is that, contrary to the situation for the algebraic closure C of R, 
we may have to do this a (possibly large) infinite number of times. It can be shown (see 
Exercises 33 and 36) that © is isomorphic to the field of all algebraic numbers, and that 
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31.17 Theorem 


31.18 Theorem 


Proof 


31.19 Definition 


31.20 Example 
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we cannot obtain © from Q by adjoining a finite number of algebraic numbers. We shall 
have to first discuss some set-theoretic machinery, Zorn’s lemma, in order to be able to 
handle such a situation. This machinery is a bit complex, so we are putting the proof 
under a separate heading. The existence theorem for F is very important, and we state 
it here so that we will know this fact, even if we do not study the proof. 


Every field F has an algebraic closure, that is, an algebraic extension F that is alge- 
braically closed. 


It is well known that C is an algebraically closed field. We recall an analytic proof for 
the student who has had a course in functions of a complex variable. There are algebraic 
proofs, but they are much longer. 


(Fundamental Theorem of Algebra) The field C of complex numbers is an alge- 
braically closed field. 


Let the polynomial f(z) € C[z] have no zero in C. Then 1/f(z) gives an entire function; 
that is, 1/f is analytic everywhere. Also if f ¢ C, limy)+00 | f(z)| = &, 80 lim). 
[1/f(2)| = 0. Thus 1/f must be bounded in the plane. Hence by Liouville’s theorem of 
complex function theory, 1/f is constant, and thus f is constant. Therefore, a nonconstant 
polynomial in C[z] must have a zero in C, so C is algebraically closed. ¢ 


Proof of the Existence of an Algebraic Closure 


We shall prove that every field has an algebraic extension that is algebraically closed. 
Mathematics students should have the opportunity to see some proof involving the Axiom 
of Choice by the time they finish college. This is a natural place for such a proof. We 
shall use an equivalent form, Zorn’s lemma, of the Axiom of Choice. To state Zorn’s 
lemma, we have to give a set-theoretic definition. 


A partial ordering of a set S is given by a relation < defined for certain ordered pairs 
of elements of S such that the following conditions are satisfied: 


1. a<aforalla é S (reflexive law). 
2. Ifa <bandb <a, thena = d (antisymmetric law). 


3. Ifa <b andb <c, thena < c (transitive law). | 


In a partially ordered set, not every two elements need be comparable; that is, for 
a,b € S, we need not have either a < b or b < a. As usual, a < b denotes a < b but 
ax#b. 

A subset T of a partially ordered set S is a chain if every two elements a and b 
in T are comparable, that is, either a < b or b < a (or both). An element u € S is an 
upper bound for a subset A of partially ordered set Sif a < u for alla ¢€ A. Finally, an 
element m of a partially ordered set S is maximal if there is no s € S such that m < s. 


The collection of all subsets of a set forms a partially ordered set under the relation < 
given by C. For example, if the whole set is R, we have Z C Q. Note, however, that for 
Z and Qt, neither Z C Qt nor QT CZ. A 
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31.21 Zorn’s Lemma _If S is a partially ordered set such that every chain in S has an upper bound in S, then 
S has at least one maximal element. 


There is no question of proving Zorn’s lemma. The lemma is equivalent to the Axiom of 
Choice. Thus we are really taking Zorn’s lemma here as an axiom for our set theory. Refer 
to the literature for a statement of the Axiom of Choice and a proof of its equivalence to 
Zorn’s lemma. (See Edgerton [47].) 

Zorn’s lemma is often useful when we want to show the existence of a largest 
or maximal structure of some kind. If a field F has an algebraic extension F that is 
algebraically closed, then F will certainly be a maximal algebraic extension of F, for 
since F is algebraically closed, it can have no proper algebraic extensions. 

The idea of our proof of Theorem 31.17 is very simple. Given a field F, we shall 
first describe a class of algebraic extensions of F that is so large that it must contain 
(up to isomorphism) any conceivable algebraic extension of F. We then define a partial 
ordering, the ordinary subfield ordering, on this class, and show that the hypotheses 
of Zorn’s lemma are satisfied. By Zorn’s lemma, there will exist a maximal algebraic 
extension F of F in this class. We shall then argue that, as a maximal element, this 
extension F can have no proper algebraic extensions, so it must be algebraically closed. 

Our proof differs a bit from the one found in many texts. We like it because it uses 
no algebra other than that derived from Theorems 29.3 and 31.4. Thus it throws into 
sharp relief the tremendous strength of both Kronecker’s theorem and Zorn’s lemma. 
The proof looks long, but only because we are writing out every little step. To the 
professional mathematician, the construction of the proof from the information in the 
preceding paragraph is a routine matter. This proof was suggested to the author during 
his graduate student days by a fellow graduate student, Norman Shapiro, who also had 
a strong preference for it. 


mw HistoricaL NoTEe 


he Axiom of Choice, although used implicitly 

in the 1870s and 1880s, was first stated explic- 
itly by Emst Zermelo in 1904 in connection with 
his proof of the well-ordering theorem, the result 
that for any set A, there exists an order—relation < 
such that every nonempty subset B of A contains 
a least element with respect to <. Zermelo’s Ax- 
iom of Choice asserted that, given any set M and 
the set S of all subsets of 7, there always exists 
a “choice” function, a function f : S > M such 
that f(M’) € M’ for every M’ in S. Zermelo noted, 
in fact, that “this logical principal cannot... be re- 
duced to a still simpler one, but it is applied with- 
out hesitation everywhere in mathematical deduc- 
tion.” A few years later he included this axiom in 
his collection of axioms for set theory, a collection 


which was slightly modified in 1930 into what is 
now called Zermelo—Fraenkel set theory, the axiom 
system generally used today as a basis of that theory. 

Zom’s lemma was introduced by Max Zorn 
(1906-1993) in 1935. Although he realized that it 
was equivalent to the well-ordering theorem (itself 
equivalent to the Axiom of Choice), he claimed 
that his lemma was more natural to use in alge- 
bra because the well-ordering theorem was some- 
how a “transcendental” principal. Other mathemati- 
cians soon agreed with his reasoning. The lemma 
appeared in 1939 in the first volume of Nicolas 
Bourbaki’s Eléments de Mathématique: Les Struc- 
tures Fondamentales de I’Analyse. It was used con- 
sistently in that work and quickly became an essen- 
tial part of the mathematician’s toolbox. 
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We are now ready to carry out our proof of Theorem 31.17, which we restate here. 


31.22 Restated Theorem 31.17 Every field F has an algebraic closure F’. 


Proof 


It can be shown in set theory that given any set, there exists a set with strictly more 
elements. Suppose we form a set 


A= {wszil f € Flx);i =0,---, (degree f)} 


that has an element for every possible zero of any f(x) € F [x]. Let Q bea set with strictly 
more elements than A. Replacing Q by QU F if necessary, we can assume F C Q. 
Consider all possible fields that are algebraic extension of F and that, as sets, consist of 
elements of £2. One such algebraic extension is F itself. If E is any extension field of F, 
and if y € Eisazero f(x) € F[x] for y ¢ F and deg(y, F) =n, then renaming y by 
w for w € Q and w ¢ F, and renaming elements dp + ayy +---+ a,_-1y" | of Fly) 
by distinct elements of & as the a; range over F, we can consider our renamed F(y) 
to be an algebraic extension field F(w) of F, with F(w) C Q and f(w) = 0. The set Q 
has enough elements to form F(w), since Q has more than enough elements to provide 
n different zeros for each element of each degree n in any subset of F'[x]. 
All algebraic extension fields E; of F, with E; C Q, form a set 


S={E,li¢J) 


that is partially ordered under our usual subfield inclusion <. One element of S is F 
itself. The preceding paragraphs shows that if F is far away from being algebraically 
closed, there will be many ficlds £; in S. 

Let T = {£j,} be achain in S, andlet W = U,£;,. We now make W into a field. Let 
a, B € W. Then there exist E;,, £;, € 8, witha ¢ Ej, and B € Ej,. Since T is a chain, 
one of the fields E;, and E,, is a subfield of the other, say Ej, < Ej,. Thena, B € Ej, 
and we use the field operations of E,, to define the sum of w and 6 in W as (@ + B) € Ej, 
and, likewise, the product as (a8) € £,,. These operations are well defined in W; they 
are independent of our choice of Ej,, since if w, 8 € E;, also, for Ej, in T, then one 
of the fields E;, and £,, is a subfield of the other, since T is a chain. Thus we have 
operations of addition and multiplication defined on W. 

All the field axioms for W under these operations now follow from the fact that 
these operations were defined in terms of addition and multiplication in fields. Thus, for 
example, 1 € F serves as multiplicative identity in W, since fora ¢ W, if 1,a € E,, 
then we have lw = a in E;,, so lw = @ in W, by definition of multiplication in W. As 
further illustration, to check the distributive laws, let a, 8B, y € W. Since T is a chain, 
we can find one field in T containing all three elements a, 6, and y, and in this field the 
distributive laws for a, 6, and y hold. Thus they hold in W. Therefore, we can view W 
as a field, and by construction, E,, < W forevery F;, € T. 

If we can show that W is algebraic over F’, then W € S will be an upper bound for 
T. Butifa € W, thena e E,, for some E,, in T, so a is algebraic over F. Hence W is 
an algebraic extension of F and is an upper bound for T. 

The hypotheses of Zorn’s lemma are thus fulfilled, so there is a maximal element 
F of S. We claim that F is algebraically closed. Let f(x) € F[x], where f(x) ¢ F. 
Suppose that f(x) has no zero in F. Since Q has many more elements than F has, we 
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can take wm € Q, where w ¢ F, and form a field F(w) € Q, with w a zero of f(x), as we 
saw in the first paragraph of this proof. Let 6 be in F(w). Then by Theorem 30.23, 6 is 
a zero of a polynomial 


B(x) = Ap +x +e + Oy_x"” 
in F[x], with a; € F, and hence a; algebraic over F. Then by Theorem 31.11 the field 


F(a, +++, @,) is a finite extension of F, and since f is algebraic over F(a, ---, @n), We 
also see that F(ag,---, a, 8) is a finite extension over F(a, ---,a@,). Theorem 31.4 
then shows that F(ao,-+-, pn, 8) is a finite extension of F’, so by Theorem 31.3, 8 is 


algebraic over F. Hence F(w) € S and F < F(w), which contradicts the choice of F as 
maximal in S. Thus f(x) must have had a zero in F, so F is algebraically closed. @ 


The mechanics of the preceding proof are routine to the professional mathematician. 
Since it may be the first proof that we have ever seen using Zorn’s lemma, we wrote the 
proof out in detail. 


@ EXERCISES 31 


Computations 
In Exercises 1 through 13, find the degree and a basis for the given field extension. Be prepared to justify your 
answers. 
L. Q(/2) over Q : 2. Q/2, V3) over @ 
3. Q(/2, V3, 18) over Q 4. Q(/2, 73) over @ 
5. QC/2, 72) over Q 6. Q(V2 + V3) over Q 
7. QG/2V3) over Q 8. QCV2, W/5) over Q 
9. OW/2, 6, 24) over Q 10. Q(V2, V6) over Q(V/3) 
11. Q(V2 + V3) over Q(V3) 12. Q(/2, V3) over Q(V2 + V3) 


13. Q(/2, V6 + V10) over Q(V3 + V5) 


Concepts 


In Exercises 14 through 17, correct the definition of the italicized term without reference to the text, if correction 
is needed, so that it is in a form acceptable for publication. 


14. An algebraic extension of a field F is a field F(a), a2, +++, &,) where each a; is a zero of some polynomial in 


F[x]. 


15. A finite extension field of a field F is one that can be obtained by adjoining a finite number of elements to F. 


16. The algebraic closure Fg of a field F in an extension field E of F is the field consisting of all elements of E 
that are algebraic over F. 


17. A field F is algebraically closed if and only if every polynomial has a zero in F. 


18. Show by an example that for a proper extension field E of a field F, the algebraic closure of F in E need not 
be algebraically closed. 
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Mark each of the following true or false. 


a. Ifa field £ is a finite extension of a field F, then £ is a finite field. 

b. Every finite extension of a field is an algebraic extension. 

c. Every algebraic extension of a field is a finite extension. 

d. The top field of a finite tower of finite extensions of fields is a finite extension of the bottom field. 
e. Q is its own algebraic closure in R, that is Q is algebraically closed in R. 

f. C is algebraically closed in C(x), where x is an indeterminate. 

g. C(x) is algebraically closed, where x is an indeterminate. 

h. The field C(x) has no algebraic closure, since C already contains all algebraic numbers. 

i. An algebraically closed field must be of characteristic 0. 

j. If E is an algebraically closed extension field of F, then E is an algebraic extension of F. 


Proof Synopsis 


20. 
21. 


Give a one-sentence synopsis of the proof of Theorem 31.3. 


Give a one- or two-sentence synopsis of the proof of Theorem 31.4. 


Theory 


22. 
23. 


24, 
25. 


26. 


27 
28. 


29. 


30. 


31. 


32. 


33. 


Let (a + bi) € C where a, b € Rand b # 0. Show that C = Rta + bi). 


Show that if E is a finite extension of a field F and [E : F] is a prime number, then E is a simple extension of 
F and, indeed, E = F(a) for every a € E notin F. 


Prove that x? — 3 is irreducible over Q(/2). 


What degree field extensions can we obtain by successively adjoining to a field F a square root of an element 
of F not a square in F, then square root of some nonsquare in this new field, and so on? Argue from this 
that a zero of x!4 — 3x? + 12 over Q can never be expressed as a rational function of square roots of rational 
functions of square roots, and so on, of elements of Q. 


Let E be a finite extension field of F. Let D be an integral domain such that F C D C E. Show that Disa 
field. 

Prove in detail that Q(/3 + V7) = Q(V3, V7). 

Generalizing Exercise 27, show that if fa + //b # 0, then Q(/a + Vb) = Q(/a, Vb) for alla and b in Q. 
[Hint: Compute (a — b)/C./a + Vb).] 

Let E be a finite extension of a field F, and let p(x) € F[x] be irreducible over F and have degree that is not 
a divisor of [E : F]. Show that p(x) has no zeros in E. 

Let E be an extension field of F. Let a € E be algebraic of odd degree over F. Show that a? is algebraic of 
odd degree over F, and F(a) = F(a’). 

Show that if F, , and K are fields with F < E < K, then K is algebraic over F if and only if E is algebraic 
over F, and K is algebraic over E. (You must not assume the extensions are finite.) 

Let E be an extension field of a field F. Prove that every a ¢€ E that is not in the algebraic closure Fr of F in 
E is transcendental over Fr. 

Let E be an algebraically closed extension field of a field F. Show that the algebraic closure Fz of F in E is 
algebraically closed. (Applying this exercise to C and Q, we see that the field of all algebraic numbers is an 
algebraically closed field.) 
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34, Show that if Z is an algebraic extension of a field F and contains all zeros in F of every f(x) € F[x]. then E 
is an algebraically closed field. 


35. Show that no finite field of odd characteristic is algebraically closed. (Actually, no finite field of characteristic 2 
is algebraically closed either.) [Hint: By counting, show that for such a finite field F, some polynomial x? — a, 
for some a € F, has no zero in F. See Exercise 32, Section 29.] 


36. Prove that, as asserted in the text, the algebraic closure of Q in C is not a finite extension of Q. 
37. Argue that every finite extension field of R is either R itself or is isomozphic to C. 


38. Use Zorn’s lemma to show that every proper ideal of a ring R with unity is contained in some maximal ideal. 


1 GEOMETRIC CONSTRUCTIONS 


In this section we digress briefly to give an application demonstrating the power of 
Theorem 31.4. For a more detailed study of geometric constructions, you are referred to 
Courant and Robbins [44, Chapter TH]. 

We are interested in what types of figures can be constructed with a compass and 
a straightedge in the sense of classical Euclidean plane geometry. We shall discuss the 
impossibility of trisecting certain angles and other classical questions. 


Constructible Numbers 


Let us imagine that we are given only a single line segment that we shall define to be 
one unit in length. A real number qa is constructible if we can construct a line segment 
of length |@| in a finite number of steps from this given segment of unit length by using 
a straightedge and a compass. 

The rules of the game are pretty strict. We suppose that we are given just two points 
at the moment, the endpoints of our unit line segment, let us suppose that they correspond 
to the points (0, 0) and (1, 0) in the Euclidean plane. We are allowed to draw a line only 
with our straightedge through two points that we have already located. Thus we can start 
by using the straightedge and drawing the line through (0, 0) and (1, 0). We are allowed 
to open our compass only to a distance between points we have already found. Let us 
open our compass to the distance between (0, 0) and (1, 0). We can then place the point 
of the compass at (1, 0) and draw a circle of radius 1, which passes through the point 
(2, 0). Thus we now have located a third point, (2, 0). Continuing in this way, we can 
locate points (3, 0), (4, 0), (1, 0), (2, 0), and so on. Now open the compass the distance 
from (0, 0) to (0, 2), put the point at (1, 0), and draw a circle of radius 2. Do the same with 
the point at (—1, 0). We have now found two new points, where these circles intersect. 
and we can put our straightedge on them to draw what we think of as the y-axis. Then 
opening our compass to the distance from (0, 0) to (1, 0), we draw a circle with center 
at (0, 0) and locate the point (0, 1) where the circle intersects the y-axis. Continuing 
in this fashion, we can locate all points (x, y) with integer coordinates in any rectangle 
containing the point (0, 0). Without going into more detail, it can be shown that it is 
possible, among other things, to erect a perpendicular to a given line at a known point 


1 This chapter is not used in the remainder of the text. 
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on the line, and find a line passing through a known point and parallel to a given line. 
Our first result is the following theorem. 


If w and f are constructible real numbers, then so are w + £, a — 8, af, and a/B, if 


p #0. 


We are given that w and £ are constructible, so there are line segments of lengths || and 
|B| available to us. Fora, B > 0, extend a line segment of length w with the straightedge. 
Start at one end of the original segment of length a, and lay off on the extension the length 
£ with the compass. This constructs a line segment of length a + 8; a — 8 is similiarly 
constructible (see Fig. 32.2). If @ and f are not both positive, an obvious breakdown into 
cases according to their signs shows that a + 8 anda — £ are still constructible. 

The construction of af is indicated in Fig. 32.3. We shall let OA be the line segment 
from the point O to the point A, and shall let |OA| be the length of this line segment. 
If OA is of length ||, construct a line / through O not containing OA. (Perhaps, if O 
is at (0, 0) and A is at (a, 0), you use the line through (0, 0) and (4, 2).) Then find the 
points P and B on/ such that O P is of length 1 and OB is of length |6|. Draw PA and 
construct /’ through B, parallel to PA and intersecting OA extended at Q. By similar 
triangles, we have 


so OO is of length ja]. —_ 

Finally, Fig. 32.4 shows that w/ is constructible if 6 A 0. Let OA be of length |a|, 
and construct / through O not containing OA. Then find B and P on/ such that OB 
is of length |f| and O P is of length 1. Draw BA and construct /’ through P, parallel to 
BA, and intersecting O A at Q. Again by similar triangles, we have 


\701 _ lal 
1 IB|’ 
so OO is of length |a/A|. Co 
a B a 
—_————————— ee | omaneneiiemmmiaaemetiine eames 
ss —_+—_______ 
———————— oo 
a+ sp a— Bp B 
32.2 Figure 


43 


oO 


32.3 Figure 
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32.4 Figure 


32.5 Corollary The set of all constructible real numbers forms a subfield F of the field of real numbers. 
Proof Proof of this corollary is immediate from Theorem 32.1. ° 


Thus the field F of all constructible real numbers contains Q, the field of rational 
numbers, since Q is the smallest subfield of R. 

From now on, we proceed analytically. We can construct any rational number. Re- 
garding our given segment 


| 


of length 1 as the basic unit on an x-axis, we can locate any point (q1, gz) in the plane 
with both coordinates rational. Any further point in the plane that we can locate by using 
a compass and a straightedge can be found in one of the following three ways: 


1. as an intersection of two lines, each of which passes through two known 
points having rational coordinates, 


2. as an intersection of a line that passes through two points having rational 
coordinates and a circle whose center has rational coordinates and whose 
radius is rational. 


3. as an intersection of two circles whose centers have rational coordinates and 
whose radii are rational. 


Equations of lines and circles of the type discussed in 1, 2, and 3 are of the form 
ax +by+c=0 
and 
x+y +tdx+eyt+ f =0, 


where a, b,c, d, e, and f are all in Q. Since in Case 3 the intersection of two circles 
with equations 


ety +daxteyt fi =0 
and 

V+yt+aheteayt f=0 
is the same as the intersection of the first circle having equation 


r+y+dxteyt fi =9, 
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and the line (the common chord) having equation 
(dq, — d&)x + (es ey + fi- fp =9, 


we see that Case 3 can be reduced to Case 2. For Case 1, a simultaneous solution of two 
linear equations with rational coefficients can only lead to rational values of x and y, 
giving us no new points. However, finding a simultaneous solution of a linear equation 
with rational coefficients and a quadratic equation with rational coefficients, as in Case 2, 
leads, upon substitution, to a quadratic equation. Such an equation, when solved by the 
quadratic formula, may have solutions involving square roots of numbers that are not 
squares in Q. 

In the preceding argument, nothing was really used involving Q except field axioms. 
If H is the smallest field containing those real numbers constructed so far, the argument 
shows that the “next new number” constructed lies in a field H(./a) for some w € H, 
where a > 0. We have proved half of our next theorem. 


The field F of constructible real numbers consists precisely of all real numbers that we 
can obtain from Q by taking square roots of positive numbers a finite number of times 
and applying a finite number of field operations. 


We have shown that F can contain no numbers except those we obtain from Q by 
taking a finite number of square roots of positive numbers and applying a finite number 
of field operations. However, if a > 0 is constructible, then Fig. 32.7 shows that fo 
is constructible. Let OA have length a, and find P on OA extended so that OP has 
Igngth 1. Find the midpoint of PA and draw a semicircle with PA as diameter. Erect 
a perpendicular to PA at O, intersecting the semicircle at Q. Then the triangles OPQ 
and OQA are similar, so 


and |O O|? = lw =a. Thus OQ is of length ,/a. Therefore square roots of constructible 
numbers are constructible. 


Theorem 32.1 showed that field operations are possible by construction. Sd 
Q 
P oO A 


32.7 Figure 


32.8 Corollary 


Proof 


32.9 Theorem 


Proof 


32.10 Theorem 


Proof 


32.11 Theorem 


Proof 
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If y is constructible and y ¢ Q, then there is a finite sequence of real numbers 
Q1,°++,@, = y such that O(a, ---, a@;) is an extension of Q(q1, ---, @;_;) of degree 2. 
In particular, [Q(v) : Q] = 2’ for some integer r > 0. 


The existence of the ew; is immediate from Theorem 32.6. Then 


2S [Q(ey, : ++, Op) : Q) 
= [Q@1, +++, &): QAWIINQY) : QI, 


by Theorem 32.4, which completes the proof. ad 


The Impossibility of Certain Constructions 


We can now show the impossibility of certain geometric constructions. 


Doubling the cube is impossible, that is, given a side of a cube, it is not always possible 
to construct with a straightedge and a compass the side of a cube that has double the 
volume of the original cube. 


Let the given cube have a side of length 1, and hence a volume of 1. The cube being 
sought would have to have a volume of 2, and hence a side of length /2. But V2 is a 
zero of irreducible x? — 2 over Q, so 


[Q(W2) : Q} = 3. 


Corollary 32.8 shows that to double this cube of volume 1, we would need to have 3 = 2” 
for séme integer r, but no such r exists. 5 


Squaring the circle is impossible; that is, given a circle, it is not always possible to 
construct with a straightedge and a compass a square having area equal to the area of the 
given circle. 


Let the given circle have a radius of 1, and hence an area of 7. We would need to construct 
a square of side ./7. But mz is transcendental over Q, so /m is transcendental over Q 
also. Sd 


Trisecting the angle is impossible; that is, there exists an angle that cannot be trisected 
with a straightedge and a compass. 


Figure 32.12 indicates that the angle @ can be constructed if and only if a segment of 
length | cos 6| can be constructed. Now 60° is a constructible angle, and we shall show 
that it cannot be trisected. Note that 


cos 30 = cos(20 + @) 
= cos 26 cos @ — sin 26 sin€ 
= (2cos’@ — 1) cos — 2sin@ cos é sind 
= (2cos” 9 — 1)cos@ — 2cos6(1 — cos” 0) 


= 4cos? @ — 3cos 6. 
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Se 
cos @ 
32.12 Figure 


[We realize that many students today have not seen the trigonometric identities we just 
used. Exercise 1 repeats Exercise 40 of Section 1 and asks you to prove the identity 
cos 36 = 4cos? 6 — 3cos@ from Euler’s formula.] 

Let 9 = 20°, so that cos 39 = 4, and let w = cos 20°. From the identity 4 cos’ 6 — 
3cos 6 = cos 36, we see that 


4a? — 3a = z 
2 


Thus @ is a zero of 8x? — 6x — 1. This polynomial is irreducible in Q[x], since, by 
Theorem 23.11, it is enough to show that it does not factor in Z[x]. But a factorization 
in Z[x] would entail a linear factor of the form (8x + 1), (4x + 1), 2x + 1), or & + 1). 
We can quickly check that none of the numbers +4, +i, +3, and +1 is a zero of 


— 


8x? — 6x — 1. Thus 


[(Q@) : Q] =3, 


so by Corollary 32.8, & is not constructible. Hence 60° cannot be trisected. 


HIstTorIcAL NOTE 


G:* mathematicians as far back as the fourth 
century B.C. had tried without success to 
find geometric constructions using straightedge and 
compass to trisect the angle, double the cube, and 
square the circle. Although they were never able to 
prove that such constructions were impossible, they 
did manage to construct the solutions to these prob- 
lems using other tools, including the conic sections. 

It was Carl Gauss in the early nineteenth cen- 
tury who made a detailed study of constructibility 
in connection with his solution of cyclotomic equa- 
tions, the equations of the form x? — 1=0 with 
Pp prime whose roots form the vertices of a regular 
p-gon. He showed that although all such equations 


ate solvable using radicals, if p — 1 is not a power 
of 2, then the solutions must involve roots higher 
than the second. In fact, Gauss asserted that any- 
one who attempted to find a geometric construc- 
tion for a p-gon where p — 1 is not a power of 
2 would “spend his time uselessly.” Interestingly, 
Gauss did not prove the assertion that such con- 
structions were impossible. That was accomplished 
in 1837 by Pierre Wantzel (1814-1848), who in fact 
proved Corollary 32.8 and also demonstrated Theo- 
rems 32.9 and 32.11. The proof of Theorem 32.10, 
on the other hand, requires a proof that z is tran- 
scendental, a result finally achieved in 1882 by 
Ferdinand Lindemann (1852-1939). 
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Note that the regular n-gon is constructible for n > 3 if and only if the angle 27/n 
is constructible, which is the case if and only if a line segment of length cos(27/n) is 


constructible. 


@ EXERCISES 32 


Computations 


1. Prove the trigonometric identity cos 39 = 4cos? 6 — 3cos @ from the Euler formula, e!? = cos@ +i sin@. 


Concepts 
Mark each of the following true or false. 


2. 


Theory 


a. 


Itis impossible to double any cube of con- 
structible edge by compass and straight- 
edge constructions. 


. It is impossible to double every cube 


of constructible edge by compass and 
straightedge constructions. 


. It is impossible to square any circle of 


constructible radius by straightedge and 
compass constructions. 


. Noconstructible angle can be trisected by 


straightedge and compass constructions. 


. Every constructible number is of degree 


2’ over Q for some integer r > 0. 


. We have shown that every real number of 


degree 2” over Q for some integer r > 0 
is constructible. 


. The fact that factorization of a positive 


integer into a product of primes is unique 
(up to order) was used strongly at the con- 
clusion of Theorems 32.9 and 32.11. 


. Counting arguments are exceedingly 


powerful mathematical tools. 


i. We can find any constructible number in 


a finite number of steps by starting with a 
given segment of unit length and using a 
straightedge and a compass. 


j. Wecan find the totality of all constructible 


numbers in a finite number of steps by 
starting with a given segment of unit 
length and using a straightedge and a 
compass. 


3, Using the proof of Theorem 32.11, show that the regular 9-gon is not constructible. 


4, Show algebraically that it is possible to construct an angle of 30°. 


5. Referring to Fig. 32.13, where AQ bisects angle OAP, 
show that the regular 10-gon is constructible (and there- 
fore that the regular pentagon is also). [Hint: Triangle 
OAP is similar to triangle APQ. Show algebraically that 
r is constructible. ] 


1 


32.13 Figure 
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In Exercises 6 through 9 use the results of Exercise 5 where needed to show that the statement is true. 


eo OO “IN 


. The regular 20-gon is constructible. 

. The regular 30-gon is constructible. 

. The angle 72° can be trisected. 

. The regular 15-gon can be constructed. 
10. 


Suppose you wanted to explain roughly in just three or four sentences, for a high school plane geometry teacher 
who never had a course in abstract algebra, how it can be shown that it is impossible to trisect an angle of 60°. 
Write down what you would say. 


FINITE FIELDS 


The purpose of this section is to determine the structure of all finite fields. We shall 
show that for every prime p and positive integer n, there is exactly one finite field (up 
to isomorphism) of order p”. This field GF(p”) is usually referred to as the Galois field 
of order p”. We shall be using quite a bit of our material on cyclic groups. The proofs 
are simple and elegant. 


The Structure of a Finite Field 
We now show that all finite fields must have prime-power order. 


33.1 Theorem Let £ be a finite extension of degree n over a finite field F. If F has g elements, then E 
has g” elements. 


Proof Let {a1,---, @,} bea basis for E as a vector space over F. By Exercise 21 of Section 30, 
every 6 € E can be uniquely written in the form 


B= bya +++ + Dyn 


for b; € F. Since each b; may be any of the g elements of F, the total number of such 
distinct linear combinations of the a; is g”. a 


33.2 Corollary If E is a finite field of characteristic p, then E contains exactly p” elements for some 
positive integer n. 


Proof Every finite field £ is a finite extension of a prime field isomorphic to the field Z,, where 
p is the characteristic of E. The corollary follows at once from Theorem 33.1. ¢ 


We now turn to the study of the multiplicative structure of a finite field. The following 
theorem will show us how any finite field can be formed from the prime subfield. 


33.3 Theorem 


Proof 


33.4 Definition 


33.5 Theorem 


Proof 


33.6 Corollary 


Proof 


33.7 Example 
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Let E be a field of p” elements contained in an algebraic closure Z p Of Z,. The elements 
of E are precisely the zeros in Z, of the polynomial x?" — x in Zp[x]. 


The set £* of nonzero elements of E forms a multiplicative group of order p” — 1 under 
the field multiplication. Fora € E*, the order of a in this group divides the order p” — 1 
of the group. Thus fora € E*, we havea?"~' = 1,soa?" = a. Therefore, every element 
in Eisazero of x?” — x. Since x?" — x canhave at most p” zeros, we see that E contains 
precisely the zeros of x?" — x in Ze a 


An element o of a field is an nth root of unity if «” = 1. It is a primitive nth root of 
unity ifa@” = 1 anda” 41 forO<m <n. | 


Thus the nonzero elements of a finite field of p” elements are all (p” — 1)th roots 
of unity. 

Recall that in Corollary 23.6, we showed that the multiplicative group of nonzero 
elements of a finite field is cyclic. This is a very important fact about finite fields; it has 
actually been applied to algebraic coding. For the sake of completeness in this section, 
we now state it here as a theorem, give a corollary, and illustrate with an example. 


The multiplicative group (F*, -) of nonzero elements of a finite field F is cyclic. 


See Gorollary 23.6. . 4 


A finite extension £ of a finite field F is a simple extension of F. 


Let aw be a generator for the cyclic group E* of nonzero elements of E. Then E = F(a) 
¢ 


Consider the finite field Zj,. By Theorem 33.5 (Z,/*, +) is cyclic. Let us try to find a 
generator of Z, * by brute force and ignorance. We start by trying 2. Since [Z1;*| = 10, 2 
must be an element of Z,,° of order dividing 10, that is, either 2, 5, or 10. Now 


2?=4, 24=47=5, and 2 = (2)5)=10=-1. 


Thus neither 2? nor 2° is 1, but, of course, 2!° = 1, so 2 is a generator of Z1,*, that is, 2 
is a primitive 10th root of unity in Z,,;. We were lucky. 

By the theory of cyclic groups, all the generators of Z,,*, that is, all the primitive 
10th roots of unity in Z;,, are of the form 2”, where n is relatively prime to 10. These 
elements are 
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#@ HistoricaL NOTE 


lthough Carl F. Gauss had shown that the set of 

residues modulo a prime p satisfied the field 
properties, it was Evariste Galois (1811-1832) who 
first dealt with what he called “incommensurable 
solutions” to the congruence F(x) = 0 (mod p), 
where F(x) is an nth degree irreducible polyno- 
mial modulo p. He noted in a paper written in 1830 
that one should consider the roots of this congru- 
ence as “a variety of imaginary symbols” that one 
can use in calculations just as one uses »/—1. Galois 
then showed that if @ is any solution of F(x) = 0 
(mod p), the expression dp + aja + Gnd? ee 
d,-10”~| takes on precisely p” different values. Fi- 
nally, he proved results equivalent to Theorems 33.3 
and 33.5 of the text. 

Galois’ life was brief and tragic. He showed 
brilliance in mathematics early on, publishing 


several papers before he was 20 and essen- 
tially established the basic ideas of Galois theory. 
He was, however, active in French revolutionary 
politics following the July revolution of 1830. In 
May 1831, he was arrested for threatening the life 
of King Louis-Philippe. Though he was acquitted, 
he was rearrested for participating, heavily armed, 
in a republican demonstration on Bastille Day of 
that year. Two months after his release from prison 
the following March, he was killed in a duel, “the 
victim of an infamous coquette and her two dupes”; 
the previous night he had written a letter to a friend 
clarifying some of his work in the theory of equa- 
tions and requesting that it be studied by other math- 
ematicians. Not until 1846, however, were his major 
papers published; it is from that date that his work 
became influential. 


The primitive 5th roots of unity in Z;, are of the form 2”, where the gcd of m and 10 is 


2, that is, 


2? = 4, 


YW—5, 2%=9 2 =3, 


The primitive square root of unity in Zy is 27> = 10 = —1. 


The Existence of GF(p”) 


We turn now to the question of the existence of a finite field of order p” for every prime 
power p’,r > 0. We need the following lemma. 


33.8 Lemma If F is a field of prime characteristic p with algebraic closure F, then x?" — x has p" 


distinct zeros in F. ¢ 
Proof Because F is algebraically closed, x?" — x factors over that field into a product of linear 
factors x — a, so it suffices to show that none of these factors occurs more than once in 
the factorization. 

Since we have not introduced an algebraic theory of derivatives, this elegant tech- 
nique is not available to us, so we proceed by long division. Observe that 0 is a zero of 
x?" — x of multiplicity 1. Suppose a 4 0 is a zero of x?” — x, and hence is a zero of 
f(x) = xP"! — 1. Then x — wis a factor of f(x) in Fx], and by long division, we find 


33.9 Lemma 


Proof 


33.10 Theorem 


Proof 


33.11 Corollary 


Proof 
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that 
f(x) 
(x — a) 


a Te a 2 nn 
PS By PO hg igP 4 es deg Oy gh, 


= g(x) 


Now g(x) has p” — 1 summands, and in g(a), each summand is 


Thus 
1 1 
g(a) = [(p" —1)-1J-=--. 
a a 


since we are in a field of characteristic p. Therefore, g(a) # 0, so @ is a zero of f(x) of 
multiplicity 1. 2 


If F is a field of prime characteristic p, then (w + 8)” =a?" + B?" for alla, B € F 
and all positive integers n. SO 


Let a, 8 € F. Applying the binomial theorem to (a + £)?, we have 


-1 
(a+ py =a" +(p-Darip+ (PPD. 1 )arape 
+---4+(p- Dap?! 4+ BP 
= a? + Ow?! B + Ow?-7p? +--+ OuB?! + BP 
=q? + B?. 
Proceeding by induction on n, suppose that we have (@ + 6)?" =a?" ' + B?"'. Then 
(ee pe Ste + pp 7 Coa + pe ye Si 4+ BP", ° 


A finite field GF(p”) of p” elements exists for every prime power p”. 


Let Zp be an algebraic closure of Z,, and let K be the subset of Z, consisting of all 
zeros of x?” — x in Zp. Let a, B € K. Lemma 33.9 shows that (@ + 8) € K, and the 
equation (@B)?" = a?” 8?" = wB shows thataB € K.Froma?’ = aw weobtain(—a)?" = 
(—1)?" a?" = (—1)?"w. If pis anodd prime, then (—1)?" = —1 andif p = 2then—1 = 1. 
Thus (—@)”" = —a so —a € K. Now Oand 1 are zeros of x?" — x. Fora 40,a@”" =a 
implies that (1/a)?" = 1/a. Thus K is a subfield of Zp containing Z,. Therefore, K is 
the desired field of p” elements, since Lemma 33.8 showed that x?” — x has p” distinct 
zeros in Zp. e 


If F is any finite field, then for every positive integer n, there is an irreducible polynomial 
in F [x] of degree n. 


Let F have q = p’ elements, where p is the characteristic of F. By Theorem 33.10, 
there is afield K < F containing Z, (up to isomorphism) and consisting precisely of the 
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zeros of x?" — x, We want to show F < K. Every element of F is a zero of x?” — x, by 
Theorem 33.3. Now p’’ = p” p”’—), Applying this equation repeatedly to the exponents 
and using the fact that for a €¢ F we have a? = a, we see that fora € F, 


rn 


(a1) 
ae” =a? 


r(n-2) f 


=a? -=a? =a, 


Thus F < K. Then Theorem 33.1 shows that we must have [K : F] =n. We have seen 
that K is simple over F in Corollary 33.6 so K = F(8) for some B € K. Therefore, 
irr(6, F’) must be of degree n. oa 


33.12 Theorem Let p be a prime and let n € Z*. If E and E’ are fields of order p", then E ~ E’. 


Proof Both E and E’ have Z, as prime field, up to isomorphism. By Corollary 33.6, E is a 
simple extension of Z, of degree n, so there exists an irreducible polynomial f(x) of 
degree n in Z,[x] such that E ~ Z,[x]/(f(x)). Because the elements of E are zeros 
of x?" — x, we see that f(x) is a factor of x?’ — x in Z,[x]. Because E’ also consists of 
zeros of x?" — x, we see that FE’ also contains zeros of irreducible f(x) in Z p[x]. Thus, 
because E’ also contains exactly p" elements, E’ is also isomorphic to Z,[x]/(f(x)). 


Sa 


Finite fields have been used in algebraic coding. In an article in the American 
Mathematical Monthly 77 (1970): 249-258, Norman Levinson constructs a linear code 
that can correct up to three errors using a finite field of order 16. 


= EXERCISES 33 


Computations 


In Exercises 1 through 3, determine whether there exists a finite field having the given number of elements. (A 
calculator may be useful.) 


1. 4096 2. 3127 3. 68,921 
4. Find the number of primitive 8th roots of unity in GF(9). 

5. Find the number of primitive 18th roots of unity in GF(19). 

6. Find the number of primitive 15th roots of unity in GF(31). 

7. Find the number of primitive 10th roots of unity in GF(23). 


Concepts 
8. Mark each of the following true or false. 


a. The nonzero elements of every finite field form a cyclic group under multiplication. 
b. The elements of every finite field form a cyclic group under addition. 

c. The zeros in C of (x8 — 1) € Q[x] form a cyclic group under multiplication. 

d. There exists a finite field of 60 elements. 

e. There exists a finite field of 125 elements. 

f. There exists a finite field of 36 elements. 
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g. The complex number i is a primitive 4th root of unity. 


h. There exists an irreducible polynomial of degree 58 in Z2[x]. 


i. The nonzero elements of Q form a cyclic group Q* under field multiplication. 


j. If F is a finite field, then every isomorphism mapping F onto a subfield of an algebraic closure F 
of F is an automorphism of F’. 


Theory 


9. Let Z, be an algebraic closure of Zs, andleta, 6 € Zy be zeros of x? + x? + 1 and of x7 + x + 1, respectively. 
Using the results of this section, show that Zo(@) = Z(B). 


10. Show that every irreducible polynomial in Z,,[x] is a divisor of x?" — x for some n. 


11. Let F be a finite field of p” elements containing the prime subficld Z,. Show that if a € F is a generator of 
the cyclic group (F*, -) of nonzero elements of F, then deg(a, Zp) = n. 


12. Show that a finite field of p* elements has exactly one subfield of p” elements for each divisor m of n. 
13. Show that x2” — x is the product of all monic irreducible polynomials in Z,[x] of a degree d dividing n. 
14. Let p be an odd prime. 


a. Show that for a € Z, where a # 0 (mod p), the congruence x* = a (mod p) has a solution in Z if and only 
if a@-)/?2 = 1 (mod p). [Hint: Formulate an equivalent statement in the finite field Z,, and use the theory 
of cyclic groups.] 

b. Using part (a), determine whether or not the polynomial x? — 6 is irreducible in Z17[x]. 


34.2 Theorem 


34.3 Lemma 
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Section 34 = |somorphism Theorems 

Section 35 Series of Groups 

Section 36 Sylow Theorems 

Section 37 Applications of the Sylow Theory 
Section 38 Free Abelian Groups 

Section 39 Free Groups 

Section 40 Group Presentations 


ISOMORPHISM THEOREMS 


There are several theorems concerning isomorphic factor groups that are known as the 
isomorphism theorems of group theory. The first of these is Theorem 14.11, which we 
restate for easy reference. The theorem is diagrammed in Fig. 34.1. 


6 
G 
9 1G} 
7 7 
be 
Yr wf ‘@ Gsomorphism) 
Pi a 
Pal a 
G/K 
34.1 Figure 


(First Isomorphism Theorem) Let ¢ : G —- G’ be a homomorphism with kernel K, 
andlet yx : G > G/K be the canonical homomorphism. There is a unique isomorphism 
uu: G/K — @[G] such that 6) = w(yvx(~)) for each x € G. 


The lemma that follows will be of great aid in our proof and intuitive understanding 
of the other two isomorphism theorems. 


Let N be a normal subgroup of a group G and let y : G — G/N be the canonical 
homomorphism. Then the map ¢ from the set of normal subgroups of G containing N 
to the set of normal subgroups of G/N given by ¢(L) = y[L] is one to one and onto. 
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Proof 


34.4 Lemma 


Proof 


34.5 Theorem 


Proof 
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Theorem 15.16 shows that if L is a normal subgroup of G containing N, then ¢(L) = 
y[L] is a normal subgroup of G/N. Because N < L, for each x € L the entire coset 
xN in G is contained in L. Thus by Theorem 13.15, y~![(L)] = L. Consequently, if L 
and M are normal subgroups of G, both containing N, and if @(L) = ¢(M) = H, then 
L = y~![H] = M. Therefore ¢ is one to one. 

If H is a normal subgroup of G/N, then y~![H] is a normal subgroup of G by 
Theorem 15.16. Because N € H and y~!{{N}] = N, we see that N C y~'LH]. Then 
é(y! [A] = y[y LH] = H. This shows that ¢ is onto the set of normal subgroups 
of G/N. a 


If H and N are subgroups of a group G, then we let 
HN =({hn|he H,ne€ N}. 


We define the join H V N of H and N as the intersection of all subgroups of G that 
contain HN; thus H v N is the smallest subgroup of G containing HN. Of course 
HV N is also the smallest subgroup of G containing both H and N, since any such 
subgroup must contain HN. In general, H N need not be a subgroup of G. However, we 
have the following lemma. 


If N is anormal subgroup of G, and if H is any subgroup of G, then H VN = HN = 
NH. Farthermore, if H is also normal in G, then HN is normal in G. 


We show that HN is a subgroup of G, from which H v N = HN follows at once. Let 
hy, ho € H and nj,n2 € N. Since N is a normal subgroup, we have nyh2 = han3 for 


“some n3 € N. Then (hyn1)(h2n2) = Ay(nyhz)n2 = hy (h2n3)n2 = (hih2)(n3n2) € HN, 


so HN is closed under the induced operation in G. Clearly e = ee isin HN.Forh ¢ H 
and n € N, we have (hn)! = n~'h-! = h7'nq for some ng € N, since N is a normal 
subgroup. Thus (hn)! € HN, so HN < G. A similar argument shows that NH isa 
subgroup,soNH =HVN=HN. 

Now suppose that H is also normal in G, and let h € H,n € N, and g € G. Then 
ghng—' = (ghg|\(gng—!) € HN, so HN is indeed normal in G. ° 


We are now ready for the second isomorphism theorem. 


(Second Isomorphism Theorem) Let H be a subgroup of G and let N be a normal 
subgroup of G. Then (HN)/N ~ H/(H ON). 


Let y : G — G/N be the canonical homomorphism and let H < G. Then y[H] is a 
subgroup of G/N by Theorem 13.12. Now the action of y on just the elements of H 
(called y restricted to H) provides us with a homomorphism mapping H onto y[H], 
and the kernel of this restriction is clearly the set of elements of N that are also in H, 
that is, the intersection H M N. Theorem 34.2 then shows that there is an isomorphism 
Hi: H/(HON) > y{H]. 

On the other hand, y restricted to HN also provides a homomorphism mapping 
AN onto y[H], because y(n) is the identity N of G/N for all n € N. The kernel 
of y restricted to HN is N. Theorem 34.2 then provides us with an isomorphism 
Ha: (HN)/N > y[4]. 


34.6 Example 


34.7 Theorem 


Proof 
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Because (HN)/N and H/(H M N) are both isomorphic to y[H], they are isomor- 
phic to each other. Indeed, 6 : (HN)/N — H/(H ON) where ¢ = by 2 will be an 
isomorphism. More explicitly, 


b((hn)N) = by | (u2((hn)N)) = wy '(h) = (A NN). + 


Lett G=ZxZxZ,H=ZxZ™x {0}, and N = {0} x Zx Z. Then clearly HN = 
ZxZxZ and HMN = {0} x Zx {0}. We have (HN)/N =~ Z and we also have 
A/ANN)~Z. A 


If H and K are two normal subgroups of G and K < H, then H/K is a normal 
subgroup of G/K. The third isomorphism theorem concerns these groups. 


(Third Isomorphism Theorem) Let H and K be normal subgroups of a group G with 
K < H. Then G/H ~ (G/K)/(H/K). 


Let 6: G > (G/K)/(H/K) be given by ¢(a) = (aK )(H/K) for a € G. Clearly ¢ is 
onto (G/K)/(H/K), and for a, b € G, 


p(ab) = [(ab)K\(H/K) = [(@K)OK)\H/K) 
= (4K )(H/K)][(6K )(H/K)] 
= $@)o(>), 


so @ is a homomorphism. The kernel consists of those x € G such that @(~) = H/K. 
These x are just the elements of H. Then Theorem 34.2 shows that G/H ~ 
(G/K)/(H/K). ° 


Anice way of viewing Theorem 34.7 is to regard the canonical map yy : G > G/H 
as being factored via a normal subgroup K of G, K < H <G, to give 


YH = VH/KYK: 


up to a natural isomorphism, as illustrated in Fig. 34.8. Another way of visualizing this 
theorem is to use the subgroup diagram in Fig. 34.9, where each group is a normal 
subgroup of G and is contained in the one above it. The larger the normal subgroup, the 
smaller the factor group. Thus we can think of G collapsed by H, that is, G/H, as being 
smaller than G collapsed by K. Theorem 34.7 states that we can collapse G all the way 
down to G/#H in two steps. First, collapse to G/K, and then, using H/K, collapse this 
to (G/K)/(H/K). The overall result is the same (up to isomorphism) as collapsing G 
by HZ. 


Yer 
G > G/H 
Natural isomorphism H 
Vx 
G/K——————. (G, H, 
/K——> (GIR) (HIB) 2 


34.8 Figure 34.9 Figure 
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34.10 Example Consider K =6Z<H=2Z<G=Z. Then G/H=Z/2Z2~Z,. Now G/K 
= Z,/6Z has elements 


6Z, 1+ 62, 2+ 62, 3+ 62, 4+ 62Z, and 5 + 62. 


Of these six cosets, 6Z, 2 + 6Z, and 4 + 6Z lie in 2Z/6Z. Thus (Z/6Z)/(2Z/6Z) has 
two elements and is isomorphic to Z, also. Alternatively, we see that Z/6Z ~ Ze, and 
2Z/6Z corresponds under this isomorphism to the cyclic subgroup (2) of Ze. Thus 
(Z/6Z)/(2Z/6Z) ~ Ze / (2) ~ Z. ~ Z/2Z. A 


@ EXERCISES 34 


Computations 


In using the three isomorphism theorems, it is often necessary to know the actual correspondence given by the 
isomorphism and not just the fact that the groups are isomorphic. The first six exercises give us training for this. 


1. 


» 


6. 


Let @ : Z12 — Z3 be the homomorphism such that ¢(1) = 2. 

a. Find the kernel K of ¢. 

b. List the cosets in Z1)/K, showing the elements in each coset. 

c. Give the correspondence between Z,./K and Z3 given by the map yw described in Theorem 34.2. 
Let @ : Zig — Zy be the homomorphism where @(1) = 10. 

a. Find the kemel K of ¢. 

b. List the cosets in Z1g/K, showing the elements in each coset. 

c. Find the group ¢[Z1s]. 

d, Give the correspondence between Z)3/K and ¢[Zys] given by the map yu described in Theorem 34.2. 
In the group Zo4, let H = (4) and N = (6). 

a, List the elements in HN (which we might write H + N for these additive groups) and in HNN. 
b. List the cosets in HN/N, showing the elements in each coset. 

c. List the cosets in H/(H MN), showing the elements in each coset. 

d. Give the correspondence between HN/N and H/(H M N) described in the proof of Theorem 34.5. 


. Repeat Exercise 3 for the group Z35 with H = (6) and N = (9). 


In the group G = Zy4, let H = (4) and K = (8). 

a. List the cosets in G/H, showing the elements in each coset. 

b. List the cosets in G/K, showing the elements in each coset. 

c. List the cosets in H/K, showing the elements in each coset. 

d. List the cosets in (G/K)/(H/K), showing the elements in each coset. 

e. Give the correspondence between G/H and (G/K)/(H/K) described in the proof of Theorem 34.7. 


Repeat Exercise 5 for the group G = Z36 with H = (9) and K = (18). 


Theory 


7. 


Show directly from the definition of a normal subgroup that if H and N are subgroups of a group G, and N is 
normal in G, then H MN is normal in H. 


Section 35 Series of Groups 311 


8. Let H, K, and L be normal subgroups of G with H < K < L.LetA=G/H, B = K/H, andC =L/H. 


a. Show that B and C are normal subgroups of A, and B < C. 
b. To what factor group of G is (A/B)/(C/B) isomorphic? 


9. Let K and L be normal subgroups of G with K v L = G,and K NL = {e}. Showthat G/K ~ LandG/L~ K. 


35.1 Definition 


Se, 


35.2 Example 


35.3 Example 


35.4 Definition 


35.5 Example 


SERIES OF GROUPS 


Subnormal and Normal Series 


This section is concerned with the notion of a series of a group G, which gives insight 
into the structure of G. The results hold for both abelian and nonabelian groups. They 
are not too important for finitely generated abelian groups because of our strong Theo- 
rem 11.12. Many of our illustrations will be taken from abelian groups, however, for 
ease of computation. 


Asubnormal (or subinvariant) series of a group G isa finite sequence Hp, Hi, ---, Hy, 
of subgroups of G such that H; < Hj, and H; is anormal subgroup of Hj. with Ho = 
{e} and H, = G. Anormal (or invariant) series of G is a finite sequence Ho, H,,---, H, 
of normal subgroups of G such that H; < Hjs1, Hp = {e}, and H, = G. | 


Note that for abelian groups the notions of subnormal and normal series coincide, 
since every subgroup is normal. A normal series is always subnormal, but the converse 
need not be true. We defined a subnormal series before a normal series, since the concept 
of a’subnormal series is more important for our work. 


Two examples of normal series of Z under addition are 
{0} <8Z2 <4Z<Z 
and 


{0} <9Z% <Z. 


Consider the group D4 of symmetries of the square in Example 8.10. The series 


{00} < {e0, i} < {0, 02, 1, M2} < Dg 


is a subnormal series, as we could check using Table 8.12. It is not a normal series since 
{o, 1} is not normal in Dg. A 


A subnormal (normal) series {X;} is a refinement of a subnormal (normal) series {H; } 
of a group G if {H;} ¢ {K;}, that is, if each H; is one of the Kj. | 


The series 


{0} < 72Z < 24Z <8Z <4Z <Z 
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35.6 Definition 


35.7 Example 


35.8 Example 


Advanced Group Theory 


is a refinement of the series 
{0} < 72Z < 8Z < Z. 
Two new terms, 4Z and 24Z, have been inserted. A 
Of interest in studying the structure of G are the factor groups H;4,/H;. These are 


defined for both normal and subnormal series, since H; is normal in H;, in either case. 


Two subnormal (normal) series {H;} and {K;} of the same group G are isomorphic if 
there is a one-to-one correspondence between the collections of factor groups {Hj+1/H;} 
and {K ;4;/K;} such that corresponding factor groups are isomorphic. 


Clearly, two isomorphic subnormal (normal) series must have the same number of 
groups. 
The two series of Zs, 
| {0} < (5) <Zis 
and 
{0} < (3) < Zis, 


are isomorphic. Both Z;5 / (5) and (3) /{0} are isomorphic to Zs, and Zs / (3) is isomorphic 
to (5)/{0}, or to Z3. A 


The Schreier Theorem 
We proceed to prove that two subnormal series of a group G have isomorphic refinements. 
This is a fundamental result in the theory of series. The proof is not too difficult. However, 
we know from experience that some students get lost in the proof, and then tend to feel 
that they cannot understand the theorem. We now give an illustration of the theorem 
before we proceed to its proof. 
Let us try to find isomorphic refinements of the series 
{0} <8Z <4Z <Z 
and 
{0} <9Z <Z 

given in Example 35.2. Consider the refinement 

{0} < 72Z<8Z<4Z<Z 
of {0} < 8Z < 4Z < Zand the refinement 

{0} < 72Z <18Z2 <9Z <Z 
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of {0} < 9Z < Z. In both cases the refinements have four factor groups isomorphic to 
Za, £2, Zo, and 72Z or Z. The order in which the factor groups occur is different to be 
sure. A 


We start with a rather technical lemma developed by Zassenhaus. This lemma is 
sometimes called the butterfly lemma, since Fig. 35.9, which accompanies the lemma, 
has a butterfly shape. 

Let H and K be subgroups of a group G, and let H* be a normal subgroup of H 
and K* be a normal subgroup of K. Applying the first statement in Lemma 34.4 to H* 
and HM K as subgroups of H, we see that H*(H  K) is a group. Similar arguments 
show that H*(H 9 K*), K*(H 0 K), and K*(H* MK) are also groups. It is not hard 
to show that H* MK is a normal subgroup of H NK (see Exercise 22). The same 
argument using Lemma 34.4 applied to H* 1 K and HM K* as subgroups of HM K 
shows that L = (H* N K)(H M K*) isa group. Thus we have the diagram of subgroups 
shown in Fig. 35.9. It is not hard to verify the inclusion relations indicated by the 
diagram. 

Since both HM K* and H*(K are normal subgroups of HM K, the second 
statement in Lemma 34.4 shows that L = (H* K)(H NM K*) is a normal subgroup 
of HO K. We have denoted this particular normal subgroup relationship by the heavy 
middle line in Fig. 35.9. We claim the other two heavy lines also indicate normal sub- 
group relationships, and that the three factor groups given by the three normal sub- 
group relations are all isomorphic. To show this, we shall define a homomorphism 
@:H*(HNK)—> (A K)/L, and show that ¢ is onto (HM K)/L with kernel 
H*(H 2 K*). It will then follow at once from Theorem 34.2 that H*(H M K*) is normal 


H K 


H'(HNK) K*(HNK) 


HOK* 


35.9 Figure 
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in H*(H 1 K), and that A*(H 1 K)/A*(H 1 K*) ~ (HA K)/L. A similar result for 
the groups on the right-hand heavy line in Fig. 35.9 then follows by symmetry. 

Let 6: H*(H™K) > (H 1 K)/L be defined as follows. For h ¢ H* and x € 
AK, let o(hx) = xL. We show ¢ is well-defined and a homomorphism. Let h,, hz € 
H* and x1,x. € HOK. If hix, =hoxo, then hy 'hy = xox! € H*N(HNK)= 
A* OK CL,sox,L = xoL. Thus ¢ is well defined. Since H* is normal in H, there is 
h3 in H* such that x;h2 = h3x,. Then 

H(A, x1)(A2x2)) = O((21h3) 1x2) = (4p%Q)L 
= Oi L)@rL) = 1x1) » O(h2x2). 
Thus ¢ is a homomorphism. 

Obviously ¢ is onto (HN K)/L. Finally ifh € H* andx € HM K, then (Hx) = 
xL = L if and only if x € L, or if and only ifhx € H*L= A*(A*NK\VANK*)= 
H*(H (1 K*). Thus Ker(¢) = H*(H 1 K*). 

We have proved the following lemma. 


(Zassenhaus Lemma) Let H and K be subgroups of a group G and let H* and K* be 
normal subgroups of H and K, respectively. Then 
1. H*(H 1 K*)is anormal subgroup of H*(# 1 K). 
2. K*(H* 1 K)is anormal subgroup of K*(H K). 
3. A*(HOK)/A*(ANK*)~ KAN K)/K*(A*NK) 
~ (HN K)/((A* 9 K)AN K*)). 


(Schreier Theorem) Two subnormal (normal) series of a group G have isomorphic 
refinements. 


Let G be a group and let 

fes= Ap < Ay <b <+<H,=G (1) 
and 

{fe} = Ky < Ki < Kp <-s) < Ky =G (2) 


be two subnormal series for G. For i where 0 <i <n — 1, form the chain of groups 
Ay = Ay (Hig 0 Ko) S Ai Ain1 1K) S++) Ss ACA O Km) = His. 


This inserts m — 1 not necessarily distinct groups between H; and Hj+,. If we do this 
for each i where 0 <i <n —1 and let H,; = H;(Hj41 9 K;), then we obtain the chain 
of groups 


{e} = Hoo < Ho < Ho <-+* < Hom-1 < Hio 
< hs < M2 <-+++ < Mimi < Ayo 


< Any < Any <-+- < Ao m-1 < Hs 


lA 


< Ani < An-12 < +++ XS An-iym—1 < An-1,m 


=G. (3) 
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This chain (3) contains nm + 1 not necessarily distinct groups, and Aig = H; for eachi. 
By the Zassenhaus lemma, chain (3) is a subnormal chain, that is, each group is normal 
in the following group. This chain refines the series (1). 

In a symmetric fashion, we set Kj; = K;(Kj+1.H;) forO <j <m—Jland0< 
i <n. This gives a subnormal chain 


{e} = Koo < Ko. < Ko2 <--+ < Kon-1 < Kio 


< Ki < Ki2 <--> < Kin-1 < K20 


< Ka, < Koo < +++ S Kon-1 S K30 


lA 


< Km-1,1 S Km-12 < +++ < Km-1n-1 < Km-1n 


=G. (4) 


This chain (4) contains mn + 1 not necessarily distinct groups, and Kj = K; for each 
j. This chain refines the series (2). 
By the Zassenhaus lemma 35.10, we have 


Ay Ai 1 0 Kj21)/ Ai A410 Ky) ~ Kj (Kya 9 Wi41)/ Kj (By n &), 


or 
Fi j41/Hiy © Kyi / Kj: (5) 


for @ <i <n-—1 and 0 < j <m-—1. The isomorphisms of relation (5) give a one- 
to-one correspondence of isomorphic factor groups between the subnormal chains (3) 
and (4). To verify this correspondence, note that Hjo = H; and Him = Hj+:, while 
Kjo = Kj and K;,, = Kj+1. Each chain in (3) and (4) contains a rectangular array of 
mn symbols <. Each < gives rise to a factor group. The factor groups arising from the 
rth row of <’s in chain (3) correspond to the factor groups arising from the rth column 
of <’s in chain (4). Deleting repeated groups from the chains in (3) and (4), we obtain 
subnormal series of distinct groups that are isomorphic refinements of chains (1) and 
(2). This establishes the theorem for subnormal series. 

For normal series, where all H; and K; are normal in G, we merely observe that 
all the groups H;,; and K,; formed above are also normal in G, so the same proof 
applies. This normality of H;,; and K;; follows at once from the second assertion in 
Lemma 34.4 and from the fact that intersections of normal subgroups of a group yield 
normal subgroups. Ad 


The Jordan-Hélder Theorem 
We now come to the real meat of the theory. 
A subnormal series {H;} of a group G is a composition series if all the factor groups 


Hj4,/H; are simple. A normal series {H;} of G is a principal or chief series if all the 
factor groups H;.,/H; are simple. | 
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Note that for abelian groups the concepts of composition and principal series coin- 
cide. Also, since every normal series is subnormal, every principal series is a composition 
series for any group, abelian or not. 


We claim that Z has no composition (and also no principal) series. For if 
{0} = Ho < My <-++ < Ay} <H,=Z 


is a subnormal series, H, must be of the form rZ for some r € Z*. But then H;/Hp is 
isomorphic to rZ, which is infinite cyclic with many nontrivial proper normal subgroups, 
for example, 2rZ. Thus Z has no composition (and also no principal) series. A 


The series 
{e} < An < S, 


for n > 5 is a composition series (and also a principal series) of S,, because A;,/{e} is 
isomorphic to A,, which is simple for n > 5, and S,/A, is isomorphic to Zy, which is 
simple. Likewise, the two series given in Example 35.7 are composition series (and also 
principal series) of Zs. They are isomorphic, as shown in that example. This illustrates 
our main theorem, which will be stated shortly. A 


Observe that by Theorem 15.18, Hj41/H; is simple if and only if 4; is a maximal 
normal subgroup of H;41. Thus for a composition series, each H; must be a maximal 
normal subgroup of H;11. To form a composition series of a group G, we just hunt for 
a maximal normal subgroup H,-, of G, then for a maximal normal subgroup Hn-2 
of H,—-1, and so on. If this process terminates in a finite number of steps, we have a 
composition series. Note that by Theorem 15.18, a composition series cannot have any 
further refinement. Jo form a principal series, we have to hunt for a maximal normal 
subgroup Hy, of G, then for a maximal normal subgroup Hy—2 of Hp—, that is also 
normal in G, and so on. The main theorem is as follows. 


(Jordan—Hélder Theorem) Any two composition (principal) series of a group G are 
isomorphic. 


Let {H;} and {K;} be two composition (principal) series of G. By Theorem 35.11, 
they have isomorphic refinements. But since all factor groups are already simple, Theo- 
rem 15.18 shows that neither series has any further refinement. Thus {#7;} and {K;} must 
already be isomorphic. ¢ 


For a finite group, we should regard a composition series as a type of factorization 
of the group into simple factor groups, analogous to the factorization of a positive 
integer into primes. In both cases, the factorization is unique, up to the order of the 
factors. 


@ HistoricaL NOTE 


his first appearance of what became the Jordan— 
Hélder theorem occurred in 1869 in a commen- 
tary on the work of Galois by the brilliant French al- 
gebraist Camille Jordan (1838-1922). The context 
of its appearance is the study of permutation groups 
associated with the roots of polynomial equations. 


Jordan asserted that even though the sequence of 
normal subgroups G, J, J,---of the group of the 
equation is not necessarily unique, nevertheless 
the sequence of indices of this composition series 
is unique. Jordan gave a proof in his monumen- 
tal 1870 Treatise on Substitutions and Algebraic 
Equations. This latter work, though restricted to 
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what we now call permutation groups, remained 
the standard treatise on group theory for many 
years. 

The Holder part of the theorem, that the se- 
quence of factor groups in a composition series 
is unique up to order, was due to Otto Holder 
(1859-1937), who played a very important role in 
the development of group theory once the com- 
pletely abstract definition of a group had been given. 
Among his other contributions, he gave the first 
abstract definition of a “factor group” and deter- 
mined the structure of all finite groups of square-free 
order. 
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35.16 Theorem If G has a composition (principal) series, and if N is a proper normal subgroup of G, 
then there exists a composition (principal) series containing N. 


Proof The series 


fe} <N<G 


is both a subnormal and a normal series. Since G has a composition series {H;}, then by 
Theorem 35.11 there is a refinement of {e} < N < G to a subnormal series isomorphic 
to arefinement of {H;}. But as a composition series, {H; } can have no further refinement. 
Thus {e} < N < G can be refined to a subnormal series all of whose factor groups are 
simple, that is, to a composition series. A similar argument holds if we start with a 
principal series {K;} of G. ¢ 


35.17 Example A composition (and also a principal) series of Z4 x Zo containing ((0, 1)) is 


{(0, 0)} < (0, 3)) < (@, 1)) < (2) x (1) < (1) x (1) = Zy x Zo. A 


The next definition is basic to the characterization of those polynomial equations 
whose solutions can be expressed in terms of radicals. 


35.18 Definition A group G is solvable if it has a composition series {H;} such that all factor groups 


Aj41/H; are abelian. a 


By the Jordan—Hoélder theorem, we see that for a solvable group, every composition 
series {H;} must have abelian factor groups Hj+,/H;. 
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The group $3 is solvable, because the composition series 
fe} < A3 < S3 


has factor groups isomorphic to Z, and Z, which are abelian. The group Ss is not 
solvable, for since As is simple, the series 


fel < As < Ss 


is acomposition series, and A5/{e}, which is isomorphic to As, is not abelian. This group 
As of order 60 can be shown to be the smallest group that is not solvable. This fact is 
closely connected with the fact that a polynomial equation of degree 5 is not in general 
solvable by radicals, but a polynomial equation of degree < 4 is. A 


The Ascending Central Series 


We mention one subnormal series for a group G that can be formed using centers of 
groups. Recall from Section 15 that the center Z(G) of a group G is defined by 


Z(G) = {ze G|zg = gz forall g € GI, 


and that Z(G) is a normal subgroup of G. If we have the table for a finite group G, it is 
easy to find the center. An element a will be in the center of G if and only if the elements 
in the row opposite a at the extreme left are given in the same order as the elements in 
the column under a at the very top of the table. 

. Now let G be a group, and let Z(G) be the center of G. Since Z(G) is normal in 
G, we can form the factor group G/Z(G) and find the center Z(G/Z(G)) of this factor 
group. Since Z(G/Z(G)) is normal in G/Z(G), if y : G > G/Z(G) is the canonical 
map, then by Theorem 15.16, y~![Z(G/Z(G))] is a normal subgroup Z(G) of G. We 
can then form the factor group G/Z,(G) and find its center, take (y; 7! of it to get Z2(G), 
and so on. 


The series 
{e} < Z(G) < Z(G) < Z2,(G) <--: 


described in the preceding discussion is the ascending central series of the group G. 
| 


The center of $3 is just the identity {9}. Thus the ascending central series of S3 is 
{po} < {oo} < {oo} S °°. 


The center of the group D4 of symmetries of the square in Example 8.10 is {/9, 2}. 
(Do you remember that we said that this group would give us nice examples of many 
things we discussed?) Since D4/{0, 02} is of order 4 and hence abelian, its center is all 
of D4/{o, 02}. Thus the ascending central series of Dy, is 


{00} < {00,2} < Da < Da < Dg S++. a 
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Computations 
In Exercises 1 through 5, give isomorphic refinements of the two series. 
1. {0} < 10Z < Zand {0} < 25Z <Z 
. {0} < 60Z < 20Z < Zand {0} < 245Z < 49% < Z 
« {0} < (3) < Zoq and {0} < (8) < Zoq 
~ {0} < (18) < (3) < Zp and {0} < (24) < (12) < Zp 
. {(0, 0)} < (60Z) x Z < (10Z) x Z < Z x Zand {(0, 0)} < Z x (80Z) < Z x 20Z)<Z2xZ 
Find all composition series of Zo and show that they are isomorphic. 
. Find all composition series of Z4g and show that they are isomorphic. 


. Find all composition series of Zs x Zs. 


econ nvnan dB ww 


. Find all composition series of 53 x Zp. 


= 
—) 


. Find all composition series of Z) x Zs x Zy. 
. Find the center of $3 x Z4. 


—— 
Ne 


. Find the center of $3 x Da. 


— 
Go 


. Find the ascending central series of $3 x Zy. 
14. Find the ascending central series of 53 x Da. 
Concepts 


In Exercises 15 and 16, cérrect the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 


15. A composition series of a group G is a finite sequence 
{fe}= Hy < H, < Hp <--- <1 <H,=G 


of subgroups of G such that H; is a maximal normal subgroup of H;.; fori =0,1,2,---,n—1. 
16. A solvable group is one that has a composition series of abelian groups. 
17. Mark each of the following true or false. 


a. Every normal series is also subnormai. 
b. Every subnormal series is also normal. 
c. Every principal series is a composition series. 


d. Every composition series is a principal series. 


e. Every abelian group has exactly one composition series. 


f. Every finite group has a composition series. 


g. A group is solvable if and only if it has a composition series with simple factor groups. 

h. 57 is a solvable group. 

i. The Jordan—Hélder theorem has some similarity with the Fundamental Theorem of Arithmetic, 
which states that every positive integer greater than 1 can be factored into a product of primes 
uniquely up to order. 


j- Every finite group of prime order is solvable. 
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18. 
19, 
20. 


21. 
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Find a composition series of 53 x $3. Is $3 x $3 solvable? 
Is the group D, of symmetries of the square in Example 8.10 solvable? 
Let G be Z36. Refer to the proof of Theorem 35.11. Let the subnormal series (1) be 


{0} < (12) < (3) < Za6 
and let the subnormal series (2) be 
{0} < (18) < Z36- 


Find chains (3) and (4) and exhibit the isomorphic factor groups as described in the proof. Write chains (3) and 
(4) in the rectangular array shown in the text. 


Repeat Exercise 20 for the group Z4 with the subnormal series (1) 
{0} < (12) < (4) < Zo, 
and (2) 
{0} < (6) < (3) < Zoq. 


Theory 


22. 
23 


24, 


25. 
26. 


27. 


Let H*, H, and K be subgroups of G with H* normal in H. Show that H* 9 K is normalin HN K. 
Show that if 


Ho ={e} < Hi <A) <:+-<H,=G 
is a subnormal (normal) series for a group G, and if H;.,/H; is of finite order s;,1, then G is of finite order 
8182+ Sp. 


Show that an infinite abelian group can have no composition series. [Hint: Use Exercise 23, together with the 
fact that an infinite abelian group always has a proper normal subgroup.] 


Show that a finite direct product of solvable groups is solvable. 


Show that a subgroup K of a solvable group G is solvable. [Hint: Let Hp = {e} < Hi <---< H,= Gbea 
composition series for G. Show that the distinct groups among K  H; fori = 0,---, form a composition 
series for K. Observe that 


(K 0 Aj)/(K 0 Hi-1) = [Ai (K 9 A/C A-1], 


by Theorem 34.5, with He KOH; and N = H;,_,, and that H;_1(K 1 H;) < 4;.] 


Let Hp = {e} < H, <--- < H, = G be a composition series for a group G. Let N be a normal subgroup of 
G, and suppose that N is a simple group. Show that the distinct groups among Ho, H;N fori = 0,---,n also 
form a composition series for G. [Hint: H;N is a group by Lemma 34.4. Show that H;_1N is normal in H;N. 
By Theorem 34.5 


(H)N)/(Hi-1N) = Hi /{H; 0 (Hi-1N)), 
and the latter group is isomorphic to 
(Hi /Ay-11/(i 0 (Ai-1.N))/ Hi-, 
by Theorem 34.7. But H;/H;-1 is simple.] 


28. 


29 
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Let G be a group, and let Hy) = {e} < H; <--- < H, = G be a composition series for G. Let N be a normal 
subgroup of G, and let y : G ~ G/N be the canonical map. Show that the distinct groups among y[H;] for 
i =0,---,n, form a composition series for G/N. [Hint: Observe that the map 


defined by 


vy: AN > y(A))/ylHi-1] 


whin) = yhin)y[Ai-1] 


is ahomomorphism with kernel H;_;N. By Theorem 34.2. 


¥(Hi)/y|Hi-1) = N)/(i-1N). 


Proceed via Theorem 34.5, as shown in the hint for Exercise 27.] 


Prove that a homomorphic image of a solvable group is solvable. [Hint: Apply Exercise 28 to get a composition 
series for the homomorphic image. The hints for Exercises 27 and 28 then show how the factor groups of this 
composition series in the image look.] 


SyLow THEOREMS 


The fundamental theorem for finitely generated abelian groups (Theorem 11.12) gives 
us complete information about all finite abelian groups. The study of finite nonabelian 
groups is much more complicated. The Sylow theorems give us some important infor- 
mation about them. 

‘We know the order of a subgroup of a finite group G must divide |G]. If G is abelian, 
then there exist subgroups of every order dividing |G|. We showed in Example 15.6 that 
Ag, which has order 12, has no subgroup of order 6. Thus anonabelian group G may have 
no subgroup of some order d dividing |G|; the “converse of the theorem of Lagrange” 
does not hold. The Sylow theorems give a weak converse. Namely, they show that if d 
is a power of a prime and d divides |G|, then G does contain a subgroup of order d. 
(Note that 6 is not a power of a prime.) The Sylow theorems also give some information 
concerning the number of such subgroups and their relationship to each other. We will 
see that these theorems are very useful in studying finite nonabelian groups. 

Proofs of the Sylow theorems give us another application of action of a group on a 
set described in Section 16. This time, the set itself is formed from the group; in some 
instances the set is the group itself, sometimes it is a collection of cosets of a subgroup, 
and sometimes it is a collection of subgroups. 


p-Groups 


Section 17 gave applications of Burnside’s formula that counted the number of orbits in 
a finite G-set. Most of our results in this section flow from an equation that counts the 
number of elements in a finite G-set. 

Let X be a finite G-set. Recall that for x € X, the orbit of x in X under G is 
Gx = {gx |g € G}. Suppose that there are r orbits in X under G, and let {x1, x2, ---, x,} 
contain one element from each orbit in X. Now every element of X is in precisely one 
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orbit, so 


r 


IX| = )— [Gx;|. (1) 


i=1 


There may be one-element orbits in X. Let Xg = {x € X | gx =x forall g € G}. Thus 
Xgq is precisely the union of the one-element orbits in X. Let us suppose there are s 
one-element orbits, where 0 < s < r. Then |Xqg| = s, and reordering the x; if necessary, 
we may rewrite Eq. (1) as 


IX =|Xel + > [Gxil. (2) 
i=s41 


Most of the results of this section will flow from Eg. (2). We shall develop 
Sylow theory as in Hungerford [10], where credit is given to R. J. Nunke for the line 
of proof. The proof of Theorem 36.3 (Cauchy’s theorem) is credited there to J. H. 
McKay. 

Theorem 36.1, which follows, is not quite a counting theorem, but it does have a 
numerical conclusion. It counts modulo p. The theorem seems to be amazingly powerful. 
In the rest of the chapter, if we choose the correct set, the correct group action on it, and 
apply Theorem 36.1, what we want seems to fall right into our lap! Compared with older 
proofs, the arguments are extremely pretty and elegant. 

Throughout this section, p will always be a prime integer. 


«Let G be a group of order p” and let X be a finite G-set. Then |X| = |Xg| (mod p). 


In the notation of Eq. (2), we know that |Gx;| divides [G| by Theorem 16.16. Conse- 
quently p divides |Gx;| for s+ 1 <i <r. Equation (2) then shows that |X| — |X@| is 
divisible by p, so |X| = |Xg| (mod p). ¢ 


Let p be a prime. A group G is a p-group if every element in G has order a power of 
the prime p. A subgroup of a group G is a p-subgroup of G if the subgroup is itself a 
p-group. | 


Our goal in this section is to show that a finite group G has a subgroup of every 
prime-power order dividing |G]. As a first step, we prove Cauchy’s theorem, which says 
that if p divides |G|, then G has a subgroup of order p. 


(Cauchy’s Theorem) Let p be a prime. Let G be a finite group and let p divide |G. 
Then G has an element of order p and, consequently, a subgroup of order p. 


We form the set X of all p-tuples (g1, g2,---, gp) of elements of G having the property 
that the product of the coordinates in G is e. That is, 


X = {(g1, 82,°°+, Sp) |ai € G and g192°+- gp =e}. 


36.4 Corollary 


Proof 


36.5 Definition 
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We claim p divides |X|. In forming a p-tuple in X, we may let g1, g2,-+-, 8p_1 be any 
elements of G, and g, is then uniquely determined as (g1g2---gp-1)”'. Thus |X| = 
|G|?—! and since p divides |G|, we see that p divides |X|. 

Let o be the cycle (1,2, 3,---, p) in S,. We let o act on X by 


O (815 825 °**s Bp) = (Bat1)s Ba(2)s***s Solp)) = (82, 835° °+s Sp» B1)- 


Note that (g2, g3,---, 8p, 81) € X, for gi(g2g83--- 8p) =e implies that g; = (g2g3--- 
8p) |, 80 (8283 °** &p)81 = e also. Thus o acts on X, and we consider the subgroup (o) 
of S$, to act on X by iteration in the natural way. 

Now |(c)| = p, so we may apply Theorem 36.1, and we know that |X| = |X,o}| 
(mod p). Since p divides |X|, it must be that p divides | X,,)| also. Let us examine Xj). 
Now (81, 82,°+*» Sp) is left fixed by o, and hence by (c), ifand only if g1 = 92. = ++. = 
gp. We know at least one element in X(,), namely (e, e, ---, €). Since p divides |X(o)|, 
there must be at least p elements in X,,). Hence there exists some elementa € G,a #e, 
such that (a,a,-+-+,a@) € X(,, and hence a? = e, so a has order p. Of course, (a) is a 
subgroup of G of order p. ¢ 


Let G be a finite group. Then G is a p-group if and only if |G| is a power of p. 
We leave the proof of this corollary to Exercise 14. ¢ 


The Sylow Theorems 


Let G be a group, and let . be the collection of all subgroups of G. We make . into 
a G-set by letting G act on Y by conjugation. That is, if H ¢ “so H < Gandg eG, 
then g acting on H yields the conjugate subgroup gH g~. (To avoid confusion, we will 
never write this action as gH.) Now Gy = {g €¢ G| gHg7! = H} is easily seen to be a 
subgroup of G (Exercise 11), and H is a normal subgroup of Gy. Since Gy consists of 
all elements of G that leave H invariant under conjugation, Gg is the largest subgroup 
of G having H as a normal subgroup. 


The subgroup G y just discussed is the normalizer of H in G and will be denoted N[H] 
from now on. a 


In the proof of the lemma that follows, we will use the fact that if H is a finite 
subgroup of a group G, then g € N[H]if ghg™! € H forallh € H.Tosec this, note that 
if ghjg~! = ghog™', then h, = hy by cancellation in the group G. Thus the conjugation 
map ig : H — H given by ig(h) = ghg™' is one to one. Because |H| is finite, i, must 
then map H onto H, so gHg~' = H and g € N[H]. 


Let H be a p-subgroup of a finite group G. Then 


(N[H]: H) =(G: H)imod p). 


LLL E__SL!L!|Lmlmmmmmm 


324 Part VII Advanced Group Theory 


@ HistoricaL NoTE 


be Sylow theorems are due to the Norwegian 
mathematician Peter Ludvig Mejdell Sylow 
(1832-1918), who published them in a brief pa- 
per in 1872. Sylow stated the theorems in terms of 
permutation groups (since the abstract definition of 
a group had not yet been given). Georg Frobenius 
re-proved the theorems for abstract groups in 1887, 
even though he noted that in fact every group can be 
considered as a permutation group (Cayley’s theo- 
em [Theorem 8.16]). Sylow himself immediately 


Proof Let #be the set of left cosets of H in G, and let H act on Ydiy left translation, so that 


A 


applied the theorems to the question of solving al- 
gebraic equations and showed that any equation 
whose Galois group has order a power of a prime p 
is solvable by radicals. 

Sylow spent most of his professional life as a 
high school teacher in Halden, Norway, and was 
only appointed to a position at Christiana Univer- 
sity in 1898. He devoted eight years of his life to 
the project of editing the mathematical works of his 
countryman Niels Henrik Abel. 


Mla a I 


h(xH) = (hx)H. Then F vecomes an H-set. Note that |\Z%\=(G: A). 


Let us determine %,, that is, 
ments of H.NowxH = A(x H)ifand only if H = x 
ThusxH = A(xH)forallh € H ifandonly ifx ‘hx = xa e Hforallh € A, 
or if and only if x7! € N[H] (see the comment before the lemma), or if and only if 


x € N[H]. Thus the left cosets in Ay are those contained in N[H]. The number of such 


cosets is (N[H1: H), so |\Ay| = (NLAI: A). 


« Since H is a p-group, it has order a power of p by Corollary 36.4. Theorem 36.1 
then tells us that |4] = | 4z| (mod p), 


that is, that (G : H) =(N[H1]: H) (mod p). 
¢ 


those left cosets that are fixed under action by all ele- 
-1pnxH orifand only ifx~'hx € H. 


36.7 Corollary Let H bea p-subgroup of a finite group G. If p divides (G : H), then N[H] #4 H. 
Proof It follows from Lemma 36.6 that p divides (N[H1: A), which must then be differe 
from 1. Thus H # N{H]. 
We are now ready for the first of the Sylow theorems, which asserts the existence 
of prime-power subgroups of G for any prime power dividing |Gj. 
36.8 Theorem (First Sylow Theorem) Let G be a finite group and let |G| = p”m where n > | and 
where p does not divide m. Then 
1. G contains a subgroup of order p’ for each i where 1 < i<n, 
2. Every subgroup H of G of order p! is a normal subgroup of a subgroup of 
order pit! for 1 <i <n. 
Proof 1. We know G contains a subgroup of order p by Cauchy’s theorem 


(Theorem 36.3). We use an induction argument and show that the existence of 
a subgroup of order p! fori <n implies the existence of a subgroup of order 
p't!. Let H be a subgroup of order p'. Since i < n, we see p divides (G : H). 
By Lemma 36.6, we then know p divides (V [H]: H). Since H isa normal 


36.9 Definition 


36.10 Theorem 


Proof 


36.11 Theorem 


Proof 
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subgroup of N[H], we can form N[H]/H, and we see that p divides 
|[N[H]/H|. By Cauchy’s theorem, the factor group N[H]/H has a subgroup 
K which is of order p. If y : NLH] — N[H]/A is the canonical 
homomorphism, then y~![K] = {x € N[H]|y(x) € K} is a subgroup of 
N[H] and hence of G. This subgroup contains H and is of order p't!. 

2. We repeat the construction in part 1 and note that H < y—'[K] < NIH] 
where |y'[K]| = p'*!. Since A is normal in N[H], it is of course normal 
in the possibly smaller group y~![K]. 5 


A Sylow p-subgroup P ofa group G is a maximal p-subgroup of G, thatis, a p-subgroup 
contained in no larger p-subgroup. a 


Let G be a finite group, where |G| = p”m as in Theorem 36.8. The theorem shows 
that the Sylow p-subgroups of G are precisely those subgroups of order p”. If P is 
a Sylow p-subgroup, every conjugate gPg—' of P is also a Sylow p-subgroup. The 
second Sylow theorem states that every Sylow p-subgroup can be obtained from P in 
this fashion; that is, any two Sylow p-subgroups are conjugate. 


(Second Sylow Theorem) Let P; and P, be Sylow p-subgroups of a finite group G. 
Then P, and P, are conjugate subgroups of G. 


Here we will let one of the subgroups act on left cosets of the other, and use Theorem 36.1. 
Let Abe the collection of left cosets of P,, and let P) acton Hby yx Pi) = (yx)P; for 
y € Py). Then. His a Py-set. By Theorem 36.1, | Ap, | = |-4| (mod p), and |Z = (G: P;) 
is not divisible by p, so |Ap,| #0. Let xP; € Bp. Then yx P,; = xP, for all y € Po, 
so x! yx P, = P; for all y € Py. Thus x~!yx € P; for all y € Ph, so. x! Pax < Py. 
Since | P;| = | P2|, we must have P; = x7! P)x, so P; and P3 are indeed conjugate sub- 
groups. ¢ 


The final Sylow theorem gives information on the number of Sylow p-subgroups. A 
few illustrations are given after the theorem, and many more are given in the next section. 


(Third Sylow Theorem) If G is a finite group and p divides |G|, then the number of 
Sylow p-subgroups is congruent to 1 modulo p and divides |G}. 


Let P be one Sylow p-subgroup of G. Let. “be the set of all Sylow p-subgroups and let 
P act on.“ by conjugation, so that x € P carries T ¢ Winto xT x~!. By Theorem 36.1, 
|.Y| = |.A%| (mod p). Let us find Y. lf T ¢ .A, then xTx~' = T for all x € P. Thus 
P <N[T]. Of course T < N[T] also. Since P and T are both Sylow p-subgroups of 
G, they are also Sylow p-subgroups of N[T]. But then they are conjugate in N[T] by 
Theorem 36.10. Since T is a normal subgroup of N[T], it is its only conjugate in N[T]. 
Thus T = P. Then .% = {P}. Since |.~| = |.%| = 1 (mod p), we see the number of 
Sylow p-subgroups is congruent to 1 modulo p. 

Now let G act on .Wby conjugation. Since all Sylow p-subgroups are conjugate, 
there is only one orbit in. “under G. If P € then |.~| = lorbit of P| = (G : Gp) by 
Theorem 16.16. (Gp is, in fact, the normalizer of P.) But (G : Gp) is a divisor of |G|, 
so the number of Sylow p-subgroups divides |G|. ad 
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36.12 Example The Sylow 2-subgroups of 5S; have order 2. The subgroups of order 2 in S3 in Example 8.7 
are 


(Po, M1}, {Po, #2}, {00, 43}. 


Note that there are three subgroups and that 3 = 1 (mod 2). Also, 3 divides 6, the order 
of $3. We can readily check that 


ip l{Po, Hit] = {0, us} and ip, Heo, Hi}] = (00, 2} 


where i,,(x) = p oes illustrating that they are all conjugate. A 


36.13 Example Let us use the Sylow theorems to show that no group of order 15 is simple. Let G have 
order 15. We claim that G has a normal subgroup of order 5. By Theorem 36.8 G has at 
least one subgroup of order 5, and by Theorem 36.11 the number of such subgroups is 
congruent to 1 modulo 5 and divides 15. Since 1, 6, and 11 are the only positive numbers 
less than 15 that are congruent to 1 modulo 5, and since among these only the number 1 
divides 15, we see that G has exactly one subgroup P of order 5. But for each g € G, the 
inner automorphism i, of G with i,(x) = gxg—! maps P onto a subgroup gPg7', again 
of order 5. Hence we must have gPg! = P for all g € G, so P is a normal subgroup 
of G. Therefore, G is not simple. (Example 37.10 will show that G must actually be 
abelian and therefore must be cyclic.) A 


We trust that Example 36.13 gives some inkling of the power of Theorem 36.11. 
Never underestimate a theorem that counts something, even modulo p. 


m EXERCISES 36 


Computations 
In Exercises 1 through 4, fill in the blanks. 
1. A Sylow 3-subgroup of a group of order 12 has order 


2. A Sylow 3-subgroup of a group of order 54 has order 


3. A group of order 24 must have either 
in Theorem 36.11.) 


4. A group of order 255 = (3)(5)(17) must have either or Sylow 3-subgroups and 
Sylow 5-subgroups. (Use only the information given in Theorem 36.11.) 


or ______ Sylow 2-subgroups. (Use only the information given 


or 


5. Find all Sylow 3-subgroups of S4 and demonstrate that they are all conjugate. 


6. Find two Sylow 2-subgroups of $4 and show that they are conjugate. 


Concepts 


In Exercises 7 through 9, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 


7. Let p be a prime. A p-group is a group with the property that every element has order p. 


8. The normalizer N[H] of a subgroup A of a group G is the set of all inner automorphisms that carry H onto 
itself. 


10. 
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. Let G bea group whose order is divisible by a prime p. The Sylow p-subgroup of a group is the largest subgroup 


P of G with the property that P has some power of p as its order. 


Mark each of the following true or false. 


a. Any two Sylow p-subgroups of a finite group are conjugate. 
b. Theorem 36.11 shows that a group of order 15 has only one Sylow 5-subgroup. 


c. Every Sylow p-subgroup of a finite group has order a power of p. 

d. Every p-subgroup of every finite group is a Sylow p-subgroup. 

e. Every finite abelian group has exactly one Sylow p-subgroup for each prime p dividing the order 
of G. 

_____f. The normalizer in G of a subgroup H of G is always a normal subgroup of G. 

g. If H is a subgroup of G, then H is always a normal subgroup of N[A]. 


_____h. A Sylow p-subgroup of a finite group G is normal in G if and only ifit is the only Sylow p-subgroup 
of G. 


____i. If G is an abelian group and H is a subgroup of G, then N[H] = H. 
j. A group of prime-power order p” has no Sylow p-subgroup. 


Theory 


11. 
12. 


13. 
14. 
15. 


16. 


17. 
18. 
19. 
20. 


21. 


22. 


Let H be a subgroup of a group G. Show that Gy = {g € G| gHg7! = H} is a subgroup of G. 


Let G be a finite group and let primes p and g # p divide |G]. Prove that if G has precisely one proper Sylow 
p-subgroup, it is a normal subgroup, so G is not simple. 


Show that every group of order 45 has a normal subgroup of order 9. 
Prove Corollary 36.4. 


Let G be a finite group and let p be a prime dividing |G]. Let P be a Sylow p-subgroup of G. Show that 
N[N[P]] = N[P]. [Hint: Argue that P is the only Sylow p-subgroup of N[N[P]], and use Theorem 36.10.] 


Let G be a finite group and let a prime p divide |G|. Let P be a Sylow p-subgroup of G and let H be any 
p-subgroup of G. Show there exists g € G such that gHg™! < P. 


Show that every group of order (35)’ has a normal subgroup of order 125. 

Show that there are no simple groups of order 255 = (3)(5)(17). 

Show that there are no simple groups of order p”m, where p is a prime, r is a positive integer, and m < p. 
Let G be a finite group. Regard G as a G-set where G acts on itself by conjugation. 

a. Show that Gg is the center Z(G) of G. (See Section 15.) 

b. Use Theorem 36.1 to show that the center of a finite nontrivial p-group is nontrivial. 


Let p be a prime. Show that a finite group of order p” contains normal subgroups H; for 0 <i <n such that 
|H;| = p’ and H; < H;,, forO <i <n. [Hint: See Exercise 20 and get an idea from Section 35.] 


Let G be a finite group and let P be a normal p-subgroup of G. Show that P is contained in every Sylow 
p-subgroup of G. 


APPLICATIONS OF THE SYLOW THEORY 


In this section we give several applications of the Sylow theorems. It is intriguing to see 
how easily certain facts about groups of particular orders can be deduced. However, we 
should realize that we are working only with groups of finite order and really making 
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37.1 Theorem 


Proof 


37.2 Definition 


37.3 Example 
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only a small dent in the general problem of determining the structure of all finite groups. 
If the order of a group has only a few factors, then the techniques illustrated in this section 
may be of some use in determining the structure of the group. This will be demonstrated 
further in Section 40, where we shall show how it is sometimes possible to describe all 
groups (up to isomorphism) of certain orders, even when some of the groups are not 
abelian. However, if the order of a finite group is highly composite, that is, has a large 
number of factors, the problem is in general much harder. 


Applications to p-Groups and the Class Equation 


Every group of prime-power order (that is, every finite p-group) is solvable. 


If G has order p’, it is immediate from Theorem 36.8 that G has a subgroup 4; of order 
p’ normal in a subgroup Aj, of order p't! for 1 <i <r. Then 


fel = Hy < A, < Ho <:+:'<H,=G 


is a composition series, where the factor groups are of order p, and hence abelian and 
actually cyclic. Thus, G is solvable. ¢ 


The older proofs of the Sylow theorems used the class equation. The line of proof in 
Section 36 avoided explicit mention of the class equation, although Eq. (2) there is a 
general form of it. We now develop the classic class equation so you will be familiar 
with it. a 

Let X be a finite G-set where G is a finite group. Then Eq. (2) of Section 36 tells 
us that 


IX] =|Xel+ >> [Gail (1) 
i=s+1 


where x; is an element in the ith orbit in X. Consider now the special case of Eq. (1), 
where X = G and the action of G on G is by conjugation, so g € G carries x € X —G 
into gxg~!. Then 


Xg = {x € G|gxg™! 


= {x € G|xg = ex forall g € G} = Z(G), 


= x forall g € G} 


the center of G. If we let c = |Z(G)| and a; = |Gx;| in Eq. (1), then we obtain 
IG) =C+Meg1 Hee thy (2) 


where 7; is the number of elements in the ith orbit of G under conjugation by itself. 
Note that n; divides |G| fore +1 <i <r since in Eq. (1) we know |Gx;| = (G : G,,), 
which is a divisor of |G]. 


Equation (2) is the class equation of G. Each orbit in G under conjugation by G isa 
conjugate class in G. | 


It is readily checked that for $3 of Example 8.7, the conjugate classes are 


{eo}, {e1, p2}, {i1, 2, 3}. 


37.4 Theorem 


Proof 


37.5 Lemma 


Proof 


37.6 Theorem 


Proof 
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The class equation of $3 is 
6=142+3. A 


For illustration of the use of the class equation, we prove a theorem that Exercise 
20(b) in Section 36 asked us to prove. 


The center of a finite nontrivial p-group G is nontrivial. 


In Eq. (2) for G, each n; divides |G| forc + 1 <i <r, so p divides each n;, and p 
divides |G|. Therefore p divides c. Now e € Z(G), soc > 1. Therefore c > p, and there 
exists some a € Z(G) where a Fe. ¢ 


We turn now to a lemma on direct products that will be used in some of the theorems 
that follow. 


Let G be a group containing normal subgroups H and K such that HM K = {e} and 
H V K =G. Then G is isomorphic to H x K. 


We start by showing that hk = kh for k € K and h € H. Consider the commutator 
hkh—-'k7 = (hkh7")k7! = h(kh—'k7!). Since H and K are normal subgroups of G, 
the two groupings with parentheses show that hkh-'k—! is in both K and H. Since 
K OH = {e}, we see that hkh~'k! = e, so hk = kh. 

Let@: H x K — G be defined by $(A, k) = hk. Then 


o((h, kh’, kK’) = OAK’, kk’) = hh kk’ 
= hkh'k' = oth, Doh’, k’), 


so @ is a homomorphism. 

If @(h, k) =e, then hk = e, so h = k7", and both h and k are in HM K. Thus 
h =k =e, so Ker(d) = {(e, e)} and ¢ is one to one. 

By Lemma 34.4, we know that HK = Hv K, and H V K =G by hypothesis. 
Thus ¢ is onto G, and H x K ~G. Sf 


For a prime number p, every group G of order p? is abelian. 


If G is not cyclic, then every element except e must be of order p. Let a be such an 
element. Then the cyclic subgroup (a) of order p does not exhaust G. Also let b € G 
with b ¢ (a). Then (a) N (b) = {e}, since an element c in (a) M (b) with c # e would 
generate both (a) and (b), giving (a) = (b), contrary to construction. From Theorem 
36.8, (a) is normal in some subgroup of order p* of G, that is, normal in all of G. 
Likewise (b}) is normal in G. Now (a) Vv (b) is a subgroup of G properly containing 
(a) and of order dividing p?. Hence (a) Vv (b) must be all of G. Thus the hypotheses of 
Lemma 37.5 are satisfied, and G is isomorphic to (a) x (b) and therefore abelian. 


Further Applications 


We turn now to a discussion of whether there exist simple groups of certain orders. We 
have seen that every group of prime order is simple. We also asserted that A, is simple 
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for n > 5 and that As is the smallest simple group that is not of prime order. There was 
a famous conjecture of Burnside that every finite simple group of nonprime order must 
be of even order. It was a triumph when this was proved by Thompson and Feit [21]. 


If p and g are distinct primes with p < q, then every group G of order pg has a single 
subgroup of order g and this subgroup is normal in G. Hence G is not simple. If g is not 
congruent to 1 modulo p, then G is abelian and cyclic. 


Theorems 36.8 and 36.11 tell us that G has a Sylow qg-subgroup and that the number 
of such subgroups is congruent to 1 modulo g and divides pq, and therefore must 
divide p. Since p < q, the only possibility is the number 1. Thus there is only one Sylow 
q-subgroup Q of G. This group Q must be normal in G, for under an inner automorphism 
it would be carried into a group of the same order, hence itself. Thus G is not simple. 
Likewise, there is a Sylow p-subgroup P of G, and the number of these divides pg 
and is congruent to 1 modulo p. This number must be either 1 or qg. If g is not congruent 
to 1 modulo p, then the number must be 1 and P is normal in G. Let us assume that 
q # 1 (mod p). Since every element in Q other than e is of order g and every element in 
P other than e is of order p, we have QN P = {e}. Also Q Vv P must be a subgroup of 
G properly containing Q and of order dividing pg. Hence OQ v P = G and by Lemma 
37.5 is isomorphic to Q x P or Zy x Zp. Thus G is abelian and cyclic. Ad 


We need another lemma for some of the counting arguments that follow. 


If H and K are finite subgroups of a group G, then 


A\\(\K 
\HK| = (Ad I) 
IHN K| 
Recall that 7K = {hk |h ¢ H,k € K}. Let |H| =r, |K|=s, and|HMK|=t. Now 
HK has atmostrs elements. However, itis possible for hk, to equal tok2, forh), hy € H 
and k,, ky € K; that is, there may be some collapsing. If A,k, = hk, then let 


x = (ho) thy = ko(ky) 


Now x =(h2)'h, shows that x © H, and x = ko(k,;)"' shows that x € K. Hence 
x €(HN K), and 


hy = hy x! and = ky = xky. 


On the other hand, if for y € (HM K) we let h3 = hyy~! and k3 = yk,, then clearly 
h3k3 = h,k,, withh3 € H andk3 € K.Thuseach element hk € HK can be represented 
in the form h;k;, forh; € H andk; € K, as many times as there are elements of HN K, 
that is, t times. Therefore, the number of elements in HK is rs/t. 5 


Lemma 37.8 is another result that counts something, so do not underestimate it. The 
lemma will be used in the following way: A finite group G cannot have subgroups H 
and K that are too large with intersections that are too small, or the order of HK would 
have to exceed the order of G, which is impossible. For example, a group of order 24 
cannot have two subgroups of orders 12 and 8 with an intersection of order 2. 


37.9 Example 


37.10 Example 


37.11 Example 


37.12 Example 


37,13 Example 


37.14 Example 
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The remainder of this section consists of several examples illustrating techniques of 
proving that all groups of certain orders are abelian or that they have nontrivial proper 
normal subgroups, that is, that they are not simple. We will use one fact we mentioned 
before only in the exercises. A subgroup H of index 2 in a finite group G is always 
normal, for by counting, we see that there are only the left cosets H itself and the coset 
consisting of all elements in G not in H. The right cosets are the same. Thus every right 
coset is a left coset, and H is normal in G. 


No group of order p” forr > 1 is simple, where p is a prime. For by Theorem 36.8 such 
a group G contains a subgroup of order p’—! normal in a subgroup of order p”, which 
must be all of G. Thus a group of order 16 is not simple; it has a normal subgroup of 
order 8. A 


Every group of order 15 is cyclic (hence abelian and not simple, since 15 is not a prime). 
This is because 15 = (5)(3), and 5 is not congruent to 1 modulo 3. By Theorem 37.7 we 
are done. A 


No group of order 20 is simple, for sucha group G contains Sylow 5-subgroups in number 
congruent to 1 modulo 5 and a divisor of 20, hence only 1. This Sylow 5-subgroup is 
then normal, since all conjugates of it must be itself. A 


No group of order 30 is simple. We have seen that if there is only one Sylow p-subgroup 
for some prime p dividing 30, we are done. By Theorem 36.11 the possibilities for the 
number of Sylow 5-subgroups are | or 6, and those for Sylow 3-subgroups are 1 or 10. 
But if G has six Sylow 5-subgroups, then the intersection of any two is a subgroup of 
each of order dividing 5, and hence just {e}. Thus each contains 4 elements of order 5 
that are in none of the others. Hence G must contain 24 elements of order 5. Similarly, 
if G has 10 Sylow 3-subgroups, it has at least 20 elements of order 3. The two types 
of Sylow subgroups together would require at least 44 elements in G. Thus there is a 
normal subgroup either of order 5 or of order 3. A 


No group of order 48 is simple. Indeed, we shall show that a group G of order 48 has 
a normal subgroup of either order 16 or order 8. By Theorem 36.11 G has either one 
or three Sylow 2-subgroups of order 16. If there is only one subgroup of order 16, it is 
normal in G, by a now familiar argument. 

Suppose that there are three subgroups of order 16, and let H and K be two of them. 
Then H % K must be of order 8, for if HM K were of order < 4, then by Lemma 37.8 
HK would have at least (16)(16)/4 = 64 elements, contradicting the fact that G has 
only 48 elements. Therefore, H M K is normal in both H and K (being of index 2, or 
by Theorem 36.8), Hence the normalizer of HM K contains both H and K and must 
have order a multiple >1 of 16 and a divisor of 48, therefore 48. Thus H  K must be 
normal in G. A 


No group of order 36 is simple. Such a group G has either one or four subgroups of order 
9. If there is only one such subgroup, it is normal in G. If there are four such subgroups, 
let H and K be two of them. As in Example 37.13, H M K must have at least 3 elements, 
or HK would have to have 81 elements, which is impossible. Thus the normalizer of 
HK has as order a multiple of >1 of 9 and a divisor of 36; hence the order must 
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be either 18 or 36. If the order is 18, the normalizer is then of index 2 and therefore is 
normal in G. If the order is 36, then H M K is normal in G. A 


Every group of order 255 = (3)(5)(17) is abelian (hence cyclic by the Fundamental 
Theorem 11.12 and not simple, since 255 is not a prime). By Theorem 36.11 such a 
group G has only one subgroup H of order 17. Then G/H has order 15 and is abelian 
by Example 37.10. By Theorem 15.20, we see that the commutator subgroup C of G is 
contained in H. Thus as a subgroup of H, C has either order 1 or 17. Theorem 36.11 
also shows that G has either 1 or 85 subgroups of order 3 and either 1 or 51 subgroups of 
order 5. However, 85 subgroups of order 3 would require 170 elements of order 3, and 51 
subgroups of order 5 would require 204 elements of order 5 in G; both together would 
then require 375 elements in G, which is impossible. Hence there is a subgroup K having 
either order 3 or order 5 and normal in G. Then G/K has either order (5)(17) or order 
(3)(17), and in either case Theorem 37.7 shows that G/K is abelian. Thus C < K and 
has order either 3, 5, or 1. Since C < H showed that C has order 17 or 1, we conclude 
that C has order 1. Hence C = {e}, and G/C ~ Gis abelian. The Fundamental Theorem 
11.12 then shows that G is cyclic. A 
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Computations 


1. Let Dg be the group of symmetries of the square in Example 8.10. 


a. Find the decomposition of Ds into conjugate classes. 
b. Write the class equation for D4. 


. By arguments similar to those used in the examples of this section, convince yourself that every nontrivial 
group of order not a prime and less than 60 contains a nontrivial proper normal subgroup and hence is not 
simple. You need not write out the details. (The hardest cases were discussed in the examples.) 


Concepts 


3. 


oan mf 


Mark each of the following true or false. 


. Every group of order 159 is cyclic. 

. Every group of order 102 has a nontrivial proper normal subgroup. 

. Every solvable group is of prime-power order. 

. Every group of prime-power order is solvable. 

. It would become quite tedious to show that no group of nonprime order between 60 and 168 is 


simple by the methods illustrated in the text. 


f. No group of order 21 is simple. 

g. Every group of 125 elements has at least 5 elements that commute with every element in the group. 
h. Every group of order 42 has a normal subgroup of order 7. 

i. Every group of order 42 has a normal subgroup of order 8. 

j. The only simple groups are the groups Z, and A, where p is a prime andn #4. 
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Theory 
4. Prove that every group of order (5)(7)(47) is abelian and cyclic. 


5. Prove that no group of order 96 is simple. 
6. Prove that no group of order 160 is simple. 


7. Show that every group of order 30 contains a subgroup of order 15. [Hint: Use the last sentence in Example 
37.12, and go to the factor group.] 


8. This exercise determines the conjugate classes of S,, for every integer n > 1. 


a. Show thatifo = (a), a, ---, @m)isacyclein S, and t is any element of S,, then tot !=(tay,Tay,---, TA). 

b. Argue from (a) that any two cycles in S, of the same length are conjugate. 

¢e. Argue from (a) and (b) that a product of s disjoint cycles in S, of lengths r; fori = 1,2, ---, s is conjugate 
to every other product of s disjoint cycles of lengths 7; in S,. 

d. Show that the number of conjugate classes in S, is p(n), where p(n) is the number of ways, neglecting 
the order of the summands, that 2 can be expressed as a sum of positive integers. The number p(7) is the 
number of partitions of 7. 

e. Compute p(n) for n = 1, 2,3, 4,5, 6, 7. 

9. Find the conjugate classes and the class equation for S4. [Hint: Use Exercise 8.] _.— 
10. Find the class equation for Ss and S¢. [Hint: Use Exercise 8.] 


11. Show that the number of conjugate classes in 5, is also the number of different abelian groups (up to isomor- 
phism) of order p”, where p is a prime number. [Hint: Use Exercise 8.] 


12. Show that if n > 2, the center of 5S, is the subgroup consisting of the identity permutation only. [Hint: Use 
Exercise 8.] 


FREE ABELIAN GROUPS 


In this section we introduce the concept of free abelian groups and prove some re- 
sults concerning them. The section concludes with a demonstration of the Fundamental 
Theorem of finitely generated abelian groups (Theorem 11.12). 


Free Abelian Groups 


We should review the notions of a generating set for a group G and a finitely generated 
group, as given in Section 7. In this section we shall deal exclusively with abelian groups 
and use additive notations as follows: 
O for the identity, + for the operation, 
nad=a+at-::+4 
—$—$—S ee’ 
n summands 
na = (—a) + (-a) +--- +(-a) 
erent, 
n summands 
Oa = 0 for the first 0 in Z and the second in G. 


forn € Zt anda eG. 


We shall continue to use the symbol x for direct product of groups rather than change 
to direct sum notation. 


334 


Part VII 


38.1 Theorem 


Proof 


38.2 Definition 


38.3 Example 


Advanced Group Theory 


Notice that {(1, 0), (0, 1)} is a generating set for the group Z x Z since 
(n,m) = n(1, 0) + m(0, 1) for any (n, m) in Z x Z. This generating set has the property 
that each element of Z x Z can be uniquely expressed in the form n(1, 0) + m(O, 1). 
That is, the coefficients n and m in Z are unique. 


Let X be a subset of a nonzero abelian group G. The following conditions on X are 
equivalent. 


1. Each nonzero element a in G can be expressed uniquely (up to order of 
summands) in the form a = myx, + n2X2-+ +++ + Mpxy for n; #4 0 in Z and 
distinct x; in X. 

2. X generates G, and nx; + moX2 +++ PAX, = 0 forn; € Z and distinct 
x; € X if and only ifn) =n. =--- =n, = 0. 


Suppose Condition 1 is true. Since G # {0}, we have X # {0}. It follows from 1 thatO ¢ 
X, forifx; = Oandx,; 4 0, then x; = xj + xj, which would contradict the uniqueness of 
the expression for x;. From Condition 1, X generates G,and nj, x1] + Nex. +--+ + NypXy = 
0 if ny =n =+++ =n, =O. Suppose that myx, + noxX2 Te Try = O with some 
n; # 0; by dropping terms with zero coefficients and renumbering, we can assume all 
nj; #0. Then 


xy = Xp + (yxy + gxg +++ + NpXy) 
= (ny + Dxy +ngx2 Fo Xe, 


which gives two ways of writing x; + 0, contradicting the uniqueness assumption in 


‘Condition 1. Thus Condition 1 implies Condition 2. 


We now show that Condition 2 implies Condition 1. Let a € G. Since X generates 
G, we see a can be written in the form a = myx, + 12%. +++) + MX. Suppose a has 
another such expression in terms of elements of X. By using some zero coefficients in 
the two expressions, we can assume they involve the same elements in X and are of the 
form 


A = Nyx + NAA To Np Xy 
a= myx, +i2X2 +++ M,X,. 


Subtracting, we obtain 


0 = (ny — my )xy + (12 — m2)x2 +--+ + (Hy — mM, Xr 


so n; —m; = 0 by Condition 2, and n; = m; fori =1,2,---,r. Thus the coefficients 
are unique. Sd 


An abelian group having a generating set X satisfying the conditions described in 
Theorem 38.1 is a free abelian group, and X is a basis for the group. a 


The group Z x Z is free abelian and {(1, 0), (O, 1)} is a basis. Similarly, a basis for the 
free abelian group Z x Z x Zis {(1, 0, 0), (0, 1, 0), (0, 0, 1)}, and so on. Thus finite 
direct products of the group Z with itself are free abelian groups. A 


38.4 Example 


38.5 Theorem 


38.6 Theorem 


Proof 
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The group Z,, is not free abelian, for nx = 0 for every x € Z,, andn 4 0, which would 
contradict Condition 2. rN 


Suppose a free abelian group G has a finite basis X = {x1, x2,---,x,}. Ifa eG 
and a + 0, then a has a unique expression of the form 


Q= Nyx, +ngX.+-+-+n,-x, for n; € Z. 


(Note that in the preceding expression for a, we included all elements x; of our finite basis 
X, as opposed to the expression for a in Condition 1 of Theorem 38.1 where the basis 
may be infinite. Thus in the preceding expression for a we must allow the possibility 
that some of the coefficients n; are zero, whereas in Condition 1 of Theorem 38.1, we 
specified that each n; 4 0.) 

We define 


:GSZxZx::-x@ 
a 
r factors 


by O(a) = (11. N22, +++, ny) and 6(0) = (0, 0, ---, 0). It is straightforward to check that 
¢ is an isomorphism. We leave the details to the exercises (see Exercise 9) and state the 
result as a theorem. 


If G is a nonzero free abelian group with a basis of r elements, then G is isomorphic to 
Zx@Zx---x Z@ for r factors. 


dt is a fact that any two bases of a free abelian group G contain the same number of 
elements. We shall prove this only if G has a finite basis, although it is also true if every 
basis of G is infinite. The proof is really lovely; it gives an easy characterization of the 
number of elements in a basis in terms of the size of a factor group. 


Let G F {0} be a free abelian group with a finite basis. Then every basis of G is finite, 
and all bases of G have the same number of elements. 


Let G have a basis {x1, x2,---,x,}. Then G is isomorphic to Zx Zx.---x Zforr 
factors. Let 2G = {2¢ |g € G}. Itis readily checked that 2G is a subgroup of G. Since 
G2rZxZx:--xZforr factors, we have 


G/2G ~ (ZX Bx + x ZOD x 2W x--» x 2Z) 


~ 2. xX xX+:-xh 


for r factors. Thus |G/2G| = 2’, so the number of elements in any finite basis X is 
log. |G/2G|. Thus any two finite bases have the same number of elements. 

It remains to show that G cannot also have an infinite basis. Let Y be any basis for G, 
and let {yi, y2,---, ys} be distinct elements in Y. Let H be the subgroup of G generated 
by {y1, y2,°°-+, Ys}, and let K be the subgroup of G generated by the remaining elements 
of Y. It is readily checked that G ~ H x K,so 


G/2G ~ (H x K)/QH x 2K) ~ (H/2H) x (K/2K). 
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Since |H/2H]| = 2°, we see |G/2G| > 2°. Since we have |G/2G| = 2’, we see that 
5s <r. Then Y cannot be an infinite set, for we could take s > r. A 


If G is a free abelian group, the rank of G is the number of elements in a basis for G. 
(All bases have the same number of elements.) | 


Proof of the Fundamental Theorem 


We shall prove the Fundamental Theorem (Theorem 11.12) by showing that any finitely 
generated abelian group is isomorphic to a factor group of the form 


(ZX Zx-+ x Z)f(dyZ x dL x--. x dZ x {0} x--- x {0}, 


where both “numerator” and “denominator” have n factors, and d, divides d2, which 
divides d; ---, which divides d,. The prime-power decomposition of Theorem 11.12 will 
then follow. ; 

To show that G is isomorphic to such a factor group, we will show that there is a 
homomorphism of Z x Z x --- x Zonto G with kernel of the form d|Z x d2Z x --- x 
d,Z x {0} x --- x {0}. The result will then follow by Theorem 14.11. The theorems that 
follow give the details of the argument. Our purpose in these introductory paragraphs is 
to let us see where we are going as we read what follows. 


Let G be a finitely generated abelian group with generating set {@1, a2, +++, dy}. Let 
@o:Z2xZx---xZ>G 
n factors 


be defined by $(h1, ha, +--+, hy) = hiay + hgag +--+ + haan. Then ¢ is a homomor- 
phism onto G. 


From the meaning of h;a; for hj € Z and a; € G, we see at once that 
ol, -++ An) + (Ri, +++ Kn) = o(hi +k, +++ Ay + kn) 
= (hy +ky)ay +++ + (ty t+ kn) an 
= (hyay + kay) + +++ + AnGn + KaGn) 
= (hia, tess +hndn) + (kya, oe + kn Qn) 
= oki, no kn) + (i, irae: hy). 
Since {a, +--+, a} generates G, clearly the homomorphism ¢ is onto G. a 
We now prove a “replacement property” that makes it possible for us to adjust a 
basis. 
If X = {x1,--+, x,} is a basis for a free abelian group G and ¢t ¢€ Z, then fori ¥ j, the 
set 
Y= {x1, vty X71, 4%; ate tXj, Xj, Xj4i, Sa Xr} 


is also a basis for G. 


Proof 


38.10 Example 


38.11 Theorem 


Proof 
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Since x; = (—t)x; + (1)\(x; + ¢x;), we see that x; can be recovered from Y. which thus 
also generates G. Suppose 


miXpteee byipxjia ty ey +x) tyaxygi te b,x, = 0. 


Then 

Nyx bee +n tanya fees tnyxy tess tnpx, = 0. 
and since X is a basis, nm) =--- =n, +njt=---=nj =--- =n, =0. Fromn; =0 
andi; +n,t = 0,itfollowsthatn; = Oalso,son, =--- =n =--- = Apes SA, = 
0, and Condition 2 of Theorem 38.1 is satisfied. Thus Y is a basis. ¢ 


A basis for Z x Z is {(1, 0), (0, 1)}. Another basis is {(1, 0), (4, 1} for @, 1) = 
4(1, 0) + (0, 1). However, {(3, 0), (0, 1)} is not a basis. For example, we cannot express 
(2, 0) in the form n,(3, 0) +.72(0, 1), forn, m2 € Z. Here (3, 0) = (1, 0) + 2(1, 0), and 
a multiple of a basis element was added to itse/f, rather than to a different basis element. 

A 


A free abelian group G of finite rank may have many bases. We show thatif K < G, 
then K is also free abelian with rank not exceeding that of G. Equally important, there 
exist bases of G and K nicely related to each other. 


Let G be anonzero free abelian group of finite rank n, andlet K be a nonzero subgroup of 


G. Then K is free abelian of rank s < n. Furthermore, there exists a basis {x;, x2, +++, Xn} 
forG and positive integers, dj, d2,---,d, where d; divides dj,; fori = 1,---,s —1, 
such that {d,x,, dyx2,---,d,x,} is a basis for K. 


We show that K has a basis of the described form, which will show that K is free abelian 
of rank at most n. Suppose Y = {3,,---, ¥,} is a basis for G. All nonzero elements in 
K can be expressed in the form 


kiy) +--+ + Rayan; 


where some |k;| is nonzero. Among all bases Y for G, select one Y; that yields the 
minimal such nonzero value {k;| as all nonzero elements of K are written in terms of the 
basis elements in Y;. By renumbering the elements of Y, if necessary, we can assume 
there is w; € K such that 


wy =dyyy + hoya tee + kya 


where d, > 0 and d, is the minimal attainable coefficient as just described. Using the 
division algorithm, we write k; = d\q; +r; whereO <r; < d, for j =2,---.n. Then 


Wy = A(1 + Goy2 tet Gn¥n) Hroy2 Fe Han. (1) 


Now let x; = yy + gay2 +--+ +4n¥n. By Theorem 38.9 {x1, y2, +--+. yp} is also a ba- 
sis for G. From Eq. (1) and our choice of Y, for minimal coefficient d,, we see that 
ry ++ cer, = 0. Thus d,x, € K. 
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We now consider bases for G of the form {x1, y2,---, y,}. Each element of K can 
be expressed in the form 


hyxy + Koyo +++ + hayn- 


Since d|x; € K, we can subtract a suitable multiple of dx; and then using the mini- 
mality of d, to see that /, is a multiple of d;, we see we actually have kyy2 + +++ + kn Yn 
in K. Among all such bases {x), yo, ---, Yn}, we choose one Y> that leads to some k; #0 
of minimal magnitude. (It is possible all k; are always zero. In this case, K is generated 
by d;x; and we are done.) By renumbering the elements of Y) we can assume that there 
is wo € K such that 


W = dyyo ess +hada 


where d < 0 and d, is minimal as just described. Exactly as in the preceding paragraph, 
we can modify our basis from Y>. = {x1, y2,---, Yn} to a basis {x1, x2, ¥3,-°--, Yn} for 
G where dx; € K and d)x. € K. Writing d, = d\q +r forO <r < d,, we see that 
{x1 + GXx2, Xo, Y3,-++, ¥n} is a basis for G, and dyx, + dox2 = di(x1 + gX2) + 1rX2 isin 
K. By our minimal choice of d,, we see r = 0, so d divides dp. 

We now consider all bases of the form {x;, %2, y3,---+, ¥,} for G and examine 
elements of K of the form k3y3 +--- +k, ¥,. The pattern is clear. The process continues 
until we obtain a basis {x1, X2,++-+.Xs, ¥s¢1.°**s Yn} Where the only element of K of 
the form kyi1¥si1 +++: +kn¥n is zero, that is, all k; are zero. We then let x,41; = 
Ve41,''+;Xn = Y, and obtain a basis for G of the form described in the statement of 
Theorem 38.11. Sd 


Every finitely generated abelian group is isomorphic to a group of the form 
Zim, X Lim, X +++ X Lm, XLXGX+-+x Z, 
where m; divides m;11 fori = 1,---,r—1. 


For the purposes of this proof, it will be convenient to use as notations Z/1Z = Z/Z ~ 
Z, = {0}. Let G be finitely generated by n elements. Let F =Zx Zx.---xZforn 
factors. Consider the homomorphism ¢ : F — G of Theorem 38.8, and let K be the 
kernel of this homomorphism. Then there is a basis for F of the form {x1,---, Xn}, 
where {d1x;,---,dsxs} is a basis for K and d; divides d;., fori = 1,---,s —1. By 
Theorem 14.11, G is isomorphic to F/K. But 


P/K ~(Z4xKZx%--+x B(hZ x dhZx:+--x dsZ x {OQ} x «++ x {O}) 
= Za, XxX Lg, Xs K Lg XOX: KZ, 


It is possible that dj = 1, in which case Zg, = {0} and can be dropped (up to 
isomorphism) from this product. Similarly, dz may be 1, and so on. We let my be the first 
d; > 1, m be the next d;, and so on, and our theorem follows at once. 5 


We have demonstrated the toughest part of the Fundamental Theorem (Theorem 
11.12). Of course, a prime-power decomposition exists since we can break the groups 
Zm, into prime-power factors. The only remaining part of Theorem 11.12 concerns the 
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uniqueness of the Betti number, of the torsion coefficients, and of the prime powers. The 
Betti number appears as the rank of the free abelian group G/T, where T is the torsion 
subgroup of G. This rank is invariant by Theorem 38.6 which shows the uniqueness of 
the Betti number. The uniqueness of the torsion coefficients and of prime powers is a 
bit more difficult to show. We give some exercises that indicate their uniqueness (see 
Exercises 14 through 272). 


@ EXERCISES 38 


Computations 


1, Find a basis {(a), a, a3), (by, bo, b3), (c}, €2, c3)} for Z x Z x Z with all aj # 0, all b; x 0, and all Cj x 0. 
(Many answers are possible.) 


2. Is {(2, 1), G, 1)} a basis for Z x Z? Prove your assertion. 
3. Is {(2, 1), (4, 1)} a basis for Z x Z? Prove your assertion. 
4. Find conditions on a, b,c, d € Z for {(a, b), (c, d)} to be a basis for Z x Z. [Hint: Solve x(a, b) + y(c, d) = 
(e, f) in R, and see when the x and y lie in Z.] 
Concepts 


In Exercises 5 and 6, correct the definition of the italicized term without reference to the text, if correction is needed, 
so that it is in a form acceptable for publication. 


5. The rank of a free abelian group G is the number of elements in a generating set for G. 

6. A basis for a nonzero abelian group G is a generating set X C G such that nyxy + nax2 + +++ + tm Xm =O for 
distinct x; € X andn; € Zonly ifn, =nz =--- =n, = 0. 

7. Show by example that it is possible for a proper subgroup of a free abelian group of finite rank r also to have 
rank r. 

8. Mark each of the following true or false. 

a. Every free abelian group is torsion free. 

b. Every finitely generated torsion-free abelian group is a free abelian group. 

c. There exists a free abelian group of every positive integer rank. 


d. A finitely generated abelian group is free abelian if its Betti number equals the number of elements 
in some generating set. 


e. If X generates a free abelian group G and X C Y C G, then Y generates G. 

_____ f. If X is a basis for a free abelian group G and X C Y CG, then Y is a basis for G. 

. Every nonzero free abelian group has an infinite number of bases. 

. Every free abelian group of rank at least 2 has an infinite number of bases. 

. If K is a nonzero subgroup of a finitely generated free abelian group, then K is free abelian. 

. If K is anonzero subgroup of a finitely generated free abelian group, then G/K is free abelian. 


Cae! i) 


Theory 
9. Complete the proof of Theorem 38.5 (See the two sentences preceding the theorem). 


10. Show that a free abelian group contains no nonzero elements of finite order. 
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11. Show that if G and G’ are free abelian groups, then G x G’ is free abelian. 


12, Show that free abelian groups of finite rank are precisely the finitely generated abelian groups containing no 
nonzero elements of finite order. 


13. Show that Q under addition is not a free abelian group. [Hint: Show that no two distinct rational numbers n/m 
and r/s could be contained in a set satisfying Condition 2 of Thorem 38.1.] 


Exercises 14 through 19 deal with showing the uniqueness of the prime powers appearing in the prime-power 
decomposition of the torsion subgroup T of a finitely generated abelian group. 


14. Let p be a fixed prime. Show that the elements of T having as order some power of p, together with zero, form 
a subgroup 7, of T. 


15. Show that in any prime-power decomposition of T,, the subgroup T, in the preceding exercise is isomorphic 
to the direct product of those cyclic factors of order some power of the prime p. [This reduces our problem 
to showing that the group 7, cannot have essentially different decompositions into products of cyclic 
groups.] 

16. Let G be any abelian group and let n be any positive integer. Show that G[n] = {x € G|nx = 0} is a subgroup 
of G. (In multiplicative notation, G[n] = {x € G|x” =e}.) 


17. Referring to Exercise 16, show that Z,-[p] x Z, for any r > 1 and prime p. 
18, Using Exercise 17, show that 


(Zn X Zpn X +++ X Lym Mp] ~ Zp x Zp X--- x Zp 


m factors 


provided each 7; > 1. 


19. Let G be a finitely generated abelian group and T, the subgroup defined in Exercise 14. Suppose T, “ Z,n x 
Zip X +++ X Lym = Lp Zpn X +++ X Lpn, where 1<ry Sr S-+-+ Sry and 1 <5) <5. <--- < 5,. We 
need to show that m =n andr; = 5s; fori = 1,---,n to complete the demonstration of uniqueness of the 
prime-power decomposition. 


a. Use Exercise 18 to show that n =m. 

b. Show r; = s;. Suppose 7; = s; for alli < 7. Show r; = s;, which will complete the proof. [Hint: Suppose 
r; <s;. Consider the subgroup p’'T, = {p’'x|x € T,}, and show that this subgroup would then have 
two prime-power decompositions involving different numbers of nonzero factors. Then argue that this is 
impossible by part (a) of this exercise. ] 


Let T be the torsion subgroup of a finitely generated abelian group. Suppose T ~ Zn, X Zn) X +++ X Zm, & 
Zn, X Zn, X +++ X Zn, where m; divides m;,, fori =1,-+-,7 —1, andn; divides nj4; forn =1,---,s—1, 
and m, > land n, > 1. We wish to show thatr = s and m, = n, fork =1,---,7, demonstrating the uniqueness 
of the torsion coefficients. This is done in Exercises 20 through 22. 


20. Indicate how a prime-power decomposition can be obtained from a torsion-coefficient decomposition. (Observe 
that the preceding exercises show the prime powers obtained are unique.) 


21. Argue from. Exercise 20 that m, and n, can both be characterized as follows. Let pj,---, p; be the distinct 
primes dividing |7|, and let Pi" yrtts py be the highest powers of these primes appearing in the (unique) 
prime-power decomposition. Then m, =n, = pi" Be 224 ae 

22. Characterize m,_; and n;_1, showing that they are equal, and continue to show m,_; =ns—1 fori =1,---, 


r—i,andthenr =s. 


39.1 Example 


39.2 Example 
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FREE GROUPS 


In this section and Section 40 we discuss a portion of group theory that is of great interest 
not only in algebra but in topology as well. In fact, an excellent and readable discussion 
of free groups and presentations of groups is found in Crowell and Fox [46, Chapters 3 
and 4]. 


Words and Reduced Words 


Let A be any (not necessarily finite) set of elements a; fori € 7. We think of A as an 
alphabet and of the a; as letters in the alphabet. Any symbol of the forma,” withn € Z 
is a syllable and a finite string w of syllables written in juxtaposition is a word. We also 
introduce the empty word 1, which has no syllables. 


Let A = {a, a2, a3}. Then 


aya; *aya3, apa; ‘asaza;’, and a, 
are all words, if we follow the convention of understanding that a;' is the same as qj. 


A 


There are two natural types of modifications of certain words, the elementary 
contractions. The first type consists of replacing an occurrence of a;"a;" in a word by 
a; The second type consists of replacing an occurrence of a;° in a word by 1, that 
is, dropping it out of the word. By means of a finite number of elementary contractions, 
every word can be changed to a reduced word, one for which no more elementary 
contractions are possible. Note that these elementary contractions formally amount to 
the usual manipulations of integer exponents. 


1 


The reduced form of the word a,?a2~!a3a17a,~" of Example 39.1 is ay?a3a,~>. A 


It should be said here once and for all that we are going to gloss over several points 
that some books spend pages proving, usually by complicated induction arguments broken 
down into many cases. For example, suppose we are given a word and wish to find its 
reduced form. There may be a variety of elementary contractions that could be performed 
first. How do we know that the reduced word we end up with is the same no matter in 
what order we perform the elementary contractions? The student will probably say this is 
obvious. Some authors spend considerable effort proving this. The author tends to agree 
here with the student. Proofs of this sort he regards as tedious, and they have never made 
him more comfortable about the situation. However, the author is the first to acknowledge 
that he is not a great mathematician. In deference to the fact that many mathematicians feel 
that these things do need considerable discussion, we shall mark an occasion when we just 
state such facts by the phrase, “It would seem obvious that,” keeping the quotation marks. 


Free Groups 


Let the set of all reduced words formed from our alphabet A be F[A]. We now make 
FA] into a group in a natural way. For w, and w2 in F[A], define w, - w2 to be the 
reduced form of the word obtained by the juxtaposition ww» of the two words. 
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If 
wy = aya Pas” 


and 


then w1 - wo = ay? a) aan. A 


“Tt would seem obvious that” this operation of multiplication on F [A] is well defined 
and associative. The empty word 1| acts as an identity element. “It would seem obvious 
that” given a reduced word w € F[A], if we form the word obtained by first writing the 
syllables of w in the opposite order and second by replacing each a;” by a;~”, then the 
resulting word w—! is a reduced word also, and 


The group F'[A] just described is the free group generated by A. | 


Look back at Theorem 7.6 and the definition preceding it to see that the present use 
of the term generated is consistent with the earlier use. 

Starting with a group G and a generating set {a; |i € 1} which we will abbreviate by 
{a;}, we might ask if G is free on {a;}, that is, if G is essentially the free group generated 
by {a;}. We define precisely what this is to mean. 


If G is a group with a set A = {a;} of generators, and if G is isomorphic to F[A] under 
amap 6: G — F|A] such that ¢(a;) = a;, then G is free on A, and the a; are free 
generators of G. A group is free if it is free on some nonempty set A. a 


The only example of a free group that has occurred before is Z, which is free on one 
generator. Note that every free group is infinite. A 


Refer to the literature for proofs of the next three theorems. We will not be using 
these results. They are stated simply to inform us of these interesting facts. 


If a group G is free on A and also on B, then the sets A and B have the same number 
of elements; that is, any two sets of free generators of a free group have the same 


cardinality. 


If G is free on A, the number of elements in A is the rank of the free group G. a 


Actually, the next theorem is quite evident from Theorem 39.7. 
Two free groups are isomorphic if and only if they have the same rank. 


A nontrivial proper subgroup of a free group is free. 


39.11 Example 


39.12 Theorem 


Proof 


39.13 Theorem 
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Let F[{x, y}] be the free group on {x, ¥}. Let 
Ye = xk yx* 
for k > 0. The y;, for k > 0 are free generators for the subgroup of F'[{x, y}] that they 


generate. This illustrates that although a subgroup of a free group is free, the rank of the 
subgroup may be much greater than the rank of the whole group! A 


Homomorphisms of Free Groups 


Our work in this section will be concerned primarily with homomorphisms defined on a 
free group. The results here are simple and elegant. 


Let G be generated by A = {a; |i € 7} and let G’ be any group. If a;’ for i € J are 
any elements in G’, not necessarily distinct, then there is at most one homomorphism 
@:G-— G’ such that $(a;) = a;'. If G is free on A, then there is exactly one such 
homomorphism. 


Let ¢ be a homomorphism from G into G’ such that }(a;) = a;'. Now by Theorem 7.6, 
for any x € G we have 
x= Il a 
i 


for some finite product of the generators a;, where the a;, appearing in the product need 
not be distinct. Then since ¢ is a homomorphism, we must have 


o(x) = | [o(a,”) = I] (aa 
j J 
Thus a homomorphism is completely determined by its values on elements of a generating 
set. This shows that there is at most one homomorphism such that $(q;) = a;’. 
Now suppose G is free on A; that is, G = F[A]. For 


x= [ Ja,” 
Jj 
in G, define y : G > G' by 


wx) =[[(@,")”. 


J 
The map is well defined, since F [A] consists precisely of reduced words; no two different 
formal products in F'[A] are equal. Since the rules for computation involving exponents 
in G’ are formally the same as those involving exponents in G, it is clear that wW(xy) = 
w(x)w(y) for any elements x and y in G, so w is indeed a homomorphism. . 


Perhaps we should have proved the first part of this theorem earlier, rather than 
having relegated it to the exercises. Note that the theorem states that a homomorphism of 
a group is completely determined if we know its value on each element of a generating 
set. This was Exercise 46 of Section 13. In particular, a homomorphism of a cyclic group 
is completely determined by its value on any single generator of the group. 


Every group G’ is a homomorphic image of a free group G. 
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Let G’ = {a;’|i € I}, andlet A = {a; |i € 1} be a set with the same number of elements 
as G’. Let G = F[A]. Then by Theorem 39.12 there exists a homomorphism y mapping 
G into G’ such that w(a;) = a;’. Clearly the image of G under 7 is all of G’. Sd 


Another Look at Free Abelian Groups 


It is important that we do not confuse the notion of a free group with the notion of 
a free abelian group. A free group on more than one generator is not abelian. In the 
preceding section, we defined a free abelian group as an abelian group that has a basis, 
that is, a generating set satisfying properties described in Theorem 38.1. There is another 
approach, via free groups, to free abelian groups. We now describe this approach. 

Let F[A] be the free group on the generating set A. We shall write F in place of 
F[A] for the moment. Note that F is not abelian if A contains more than one element. 
Let C be the commutator subgroup of F. Then F/C is an abelian group, and it is not 
hard to show that F/C is free abelian with basis {aC |a € A}. If aC is renamed a, we 
can view F'/C as a free abelian group with basis A. This indicates how a free abelian 
group having a given set as basis can be constructed. Every free abelian group can be 
constructed in this fashion, up to isomorphism. That is, if G is free abelian with basis 
X, form the free group F[X], form the factor group of F[X] modulo its commutator 
subgroup, and we have a group isomorphic to G. 

Theorems 39.7, 39.9, and 39.10 hold for free abelian groups as well as for free 
groups. In fact, the abelian version of Theorem 39.10 was proved for the finite rank 
case in Theorem 38.11. In contrast to Example 39.11 for free groups, it is true that for 
a free abelian group the rank of a subgroup is at most the rank of the entire group. 
Theorem 38.11 also showed this for the finite rank case. 


% EXERCISES 39 


Computations 


1. Find the reduced form and the inverse of the reduced form of each of the following words. 
a @b Rac ctb b. aa Sbiatctc2aa} 

. Compute the products given in parts (a) and (b) of Exercise 1 in the case that {a, b, c} is a set of generators 
forming a basis for a free abelian group. Find the inverse of these products. 


. How many different homomorphisms are there of a free group of rank 2 into 


a. Z4? b. Ze? ec. S39 
. How many different homomorphisms are there of a free group of rank 2 onto 

a. Z4? b. Ze? ce. 53? 
. How many different homomorphisms are there of a free abelian group of rank 2 into 

a. Z4? b. Ze? ec. $3? 
. How many different homomorphisms are there of a free abelian group of rank 2 onto 

a. Za? b. Ze? c. $3? 
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Concepts 


In Exercises 7 and 8, correct the definition of the italicized term without reference to the text, if correction is needed, 
so that it is in a form acceptable for publication. 


7. A reduced word is one in which there are no appearances in juxtaposition of two syllables having the same 
letter and also no appearances of a syllable with exponent 0. 


8. The rank of a free group is the number of elements in a set of generators for the group. 


9. Take one of the instances in this section in which the phrase “It would seem obvious that” was used and discuss 
your reaction in that instance. 


10. Mark each of the following true or false. 


| 


. Every proper subgroup of a free group is a free group. 

. Every proper subgroup of every free abelian group is a free group. 

. A homomorphic image of a free group is a free group. 

. Every free abelian group has a basis. 

. The free abelian groups of finite rank are precisely the finitely generated abelian groups. 
. No free group is free. 

. No free abelian group is free. 

. No free abelian group of rank >1 is free. 


Sr mo ao om hb 


i, Any two free groups are isomorphic. 
j. Any two free abelian groups of the same rank are isomorphic. 


Theory 


11. Let G be a finitely generated abelian group with identity 0. A finite sct {b),---, b,}, where b; € G, is a basis 
for G if {bi,---, b,} generates G and )~"_, m,;b; = O if and only if each m;b; = 0, where m; € Z. 


aa) 


d. 


. Show that {2, 3} is not a basis for Z4. Find a basis for Z,. 
. Show that both {1} and {2, 3} are bases for Ze. (This shows that for a finitely generated abelian group G 


with torsion, the number of elements in a basis may vary; that is, it need not be an invariant of the group 
G.) 


. Is a basis for a free abelian group as we defined it in Section 38 a basis in the sense in which it is used in 


this exercise? 
Show that every finite abelian group has a basis {by, --- , b,}, where the order of b; divides the order of b; +1. 


In present-day expositions of algebra, a frequently used technique (particularly by the disciples of N. Bourbaki) for 
introducing a new algebraic entity is the following: 


1. 
2. 


3. 


Describe algebraic properties that this algebraic entity is to possess. 


Prove that any two algebraic entities with these properties are isomorphic, that is, that these 
properties characterize the entity. 


Show that at least one such entity exists. 


The next three exercises illustrate this technique for three algebraic entities, each of which we have met before. 
So that we do not give away their identities, we use fictitious names for them in the first two exercises. The last part 
of these first two exercises asks us to give the usual name for the entity. 
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12. Let G be any group. An abelian group G* is a blip group of G if there exists a fixed homomorphism ¢ of G 
onto G* such that each homomorphism yw of G into an abelian group G’ can be factored as yy = 0, where 0 
is ahomomorphism of G* into G’ (see Fig. 39.14). 


13. 


a. 


Show that any two blip groups of G are isomorphic. [Hint: Let G;* and G,* be two blip groups of G. 
Then each of the fixed homomorphisms ¢; : G > G,* and ¢, : G + G,* can be factored via the other 
blip group according to the definition of a blip group; that is, 6; = 6;@2 and ¢@2 = 0.¢,. Show that 6 1s an 
isomorphism of G2* onto G;* by showing that both 6,6) and 96, are identity maps.] 


b. Show for every group G that a blip group G* of G exists. 


. What concept that we have introduced before corresponds to this idea of a blip group of G? 


v u i 
G isa Com » G 
> ra NS fe 
a G 


39.14 Figure 39.15 Figure 


Let S be any set. A group G together with a fixed function g : 5 > G constitutes a blop group on S if for 
each group G’ and map f : S — G’ there exists a unique homomorphism @, of G into G’ such that f = org 
(see Fig. 39.15). 


a. 


Let S be a fixed set. Show that if both G), together with g; : S > Gi, and Go, together with g. : S Go, 
are blop groups on S, then G; and G2 are isomorphic. [Hint: Show that g; and gz are one-to-one maps and 
that gS and g)5 generate G, and G2, respectively. Then proceed in a way analogous to that given by the 
hint for Exercise 12.] 


. Let S be a set. Show that a blop group on S exists. You may use any theorems of the text. 
. What concept that we have introduced before corresponds to this idea of a blop group on S$? 


14. Characterize a free abelian group by properties in a fashion similar to that used in Exercise 13. 


Group PRESENTATIONS 


Definition 


Following most of the literature on group presentations, in this section we let 1 be the 
identity of a group. The idea of a group presentation is to form a group by giving a set of 
generators for the group and certain equations or relations that we want the generators 
to satisfy. We want the group to be as free as it possibly can be on the generators, subject 
to these relations. 


40.1 Example Suppose G has generators x and y and is free except for the relation xy = yx, which 


we may express as xyx 'y ! = 1. Note that the condition xy = yx is exactly what 
is needed to make G abelian, even though xyx~!y~! is just one of the many possible 
commutators of F[{x, y}]. Thus G is free abelian on two generators and is isomorphic to 
F[{x, y}] modulo its commutator subgroup. This commutator subgroup of F[{x, y}] 
is the smallest normal subgroup containing xyx~'y—!, since any normal subgroup 


Section 40 Group Presentations 347 


containing xyx—'y—! gives rise to a factor group that is abelian and thus contains the 
commutator subgroup by Theorem 15.20. A 


The preceding example illustrates the general situation. Let F[A] be a free group 
and suppose that we want to form a new group as much like F[A] as it can be, subject to 
certain equations that we want satisfied. Any equation can be written in a form in which 
the right-hand side is 1. Thus we can consider the equations to ber; = 1 fori € J, where 
r; € F[A]. If we require that r; = 1, then we will have to have 


ate ae =1 


for any x € F[A] andn é Z. Also any product of elements equal to 1 will again have to 
equal 1. Thus any finite product of the form 


GES (i;")x7", 
i 


where the r;, need not be distinct, will have to equal | in the new group. It is readily 
checked that the set of all these finite products is a normal subgroup R of F [A]. Thus any 
group looking as much as possible like FLA], subject to the requirements r; = 1, also has 
r =1foreveryr € R. But F[A]/R looks like F[A] (emember that we multiply cosets 
by choosing representatives), except that R has been collapsed to form the identity 1. 
Hence the group we are after is (at least isomorphic to) F[A]/R. We can view this group 
as described by the generating set A and the set {r; |i € 7}, which we will abbreviate 


{ri}. 


m@ HISTORICAL NOTE 


he idea of a group presentation already ap- 

pears in Arthur Cayley’s 1859 paper, “On the 
Theory of Groups as Depending on the Symbolic 
Equation 6” = 1. Third Part.” In this article, Cayley 
gives a complete enumeration of the five groups of 
order 8, both by listing all the elements of each 
and by giving for each a presentation. For exam- 
ple, his third example is what is here called the 
octic group; Cayley notes that this group is gener- 
ated by the two elements a, 6 with the relations 
ot = 1, 6B? = 1,@f = Ba’. He also shows more 
generally that a group of order mn is generated by 
a, B with the relations 7” = 1, 8” = 1,@8 = Bo* 
if and only if s” = 1 (mod m) (see Exercise 13). 

In 1878, Cayley returned to the theory of groups 
and noted that a central problem in that theory is the 


determination of all groups of a given order n. In the 
early 1890s, Otto Hélder published several papers 
attempting to solve Cayley’s problem. Using tech- 
niques similar to those discussed in Sections 36, 
37, and 40, Hélder determined all simple groups 
of order up to 200 and characterized all the groups 
of orders p>, pq”, pgr, and p*, where p,q,r are 
distinct prime numbers. Furthermore, he developed 
techniques for determining the possible structures 
of a group G, if one is given the structure of a nor- 
mal subgroup H and the structure of the factor group 
G/H. Interestingly, since the notion of an abstract 
group was still fairly new at this time, Hdlder typi- 
cally began his papers with the definition of a group 
and also emphasized that isomorphic groups are es- 
sentially one and the same object. 


40.2 Definition Let A beasetand let {r;} C F[A]. Let R be the least normal subgroup of F [A] containing 
the r;. An isomorphism ¢ of F[A]/R onto a group G is a presentation of G. The sets 
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A and {r;} give a group presentation. The set A is the set of generators for the 
presentation and each r; is a relator. Eachr < R is a consequence of {7;}. An equation 
r; = 1 is a relation. A finite presentation is one in which both A and {r;} are finite 
sets. | 


This definition may seem complicated, but it really is not. In Example 40.1, {x, y} 
is our set of generators and xyx—!y7! is the only relator. The equation xyx—!y—! = 1, 
or xy = yx, is arelation. This was an example of a finite presentation. 

If a group presentation has generators x; and relators r;, we shall use the notations 


(x; 27) or (xjin =) 


to denote the group presentation. We may refer to F[{x;}]/R as the group with presen- 
tation (xj : Ti). 


Isomorphic Presentations 
Consider the group presentation with 
A={a} and = {r;} = {a}, 
that is, the presentation 
(a: a= 1). 


This group defined by one generator a, with the relation a® = 1, is isomorphic to Ze. 
_ Now consider the group defined by two generators a and b, with a= 1, = |, 
and ab = ba, that is, the group with presentation 


(a,b: a’, b’, aba“'b7'), 


The condition a? = 1 gives a“! = a. Also b? = 1 gives b~' = b?. Thus every element 


in this group can be written as a product of nonnegative powers of a and b. The relation 
aba~'b— = 1, that is, ab — ba, allows us to write first all the factors involving a and 
then the factors involving b. Hence every element of the group is equal to some ab". 
But then a? = 1 and b° = 1 show that there are just six distinct elements, 


1, b, b*, a, ab, ab’. 


Therefore this presentation also gives a group of order 6 that is abelian, and by the 
Fundamental Theorem 11.12, it must again be cyclic and isomorphic to Z¢. A 


The preceding example illustrates that different presentations may give isomor- 
phic groups. When this happens, we have isomorphic presentations. To determine 
whether two presentations are isomorphic may be very hard. It has been shown (see 
Rabin [22]) that a number of such problems connected with this theory are not generally 
solvable; that is, there is no routine and well-defined way of discovering a solution in all 
cases. These unsolvable problems include the problem of deciding whether two presen- 
tations are isomorphic, whether a group given by a presentation is finite, free, abelian, 
or trivial, and the famous word problem of determining whether a given word w is a 
consequence of a given set of relations {7;}. 


40.4 Example 


40.5 Example 
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The importance of this material is indicated by our Theorem 39.13, which guarantees 
that every group has a presentation. 


Let us show that 
Gopi ray oy So) 


is a presentation of the trivial group of one element. We need only show that x and v 
are consequences of the relators y?2xy~! and yx?yx~!, or that x = 1 and y = 1 can be 
deduced from y*x = y and yx*y = x. We illustrate both techniques. 

As a consequence of y*xy—', we get yx upon conjugation by y~!. From yx we 
deduce x~ly7!, and then (x~!y7!)(yx?yx7) gives xyx7!. Conjugating xyx7! by x7}, 
we get y. From y we get y~!, and y!(yx) is x. 

Working with relations instead of relators, from y*x = y we deduce yx = 1 upon 
multiplication by y~! on the left. Then substituting yx = 1 into yx?y =x, that is, 
(yx)(xy) = x, we get xy =x. Then multiplying by x~! on the left, we have y = 1. 
Substituting this in yx = 1, we getx = 1. 

Both techniques amount to the same work, but it somehow seems more natural to 
most of us to work with relations. A 


Applications 


We conclude this chapter with two applications. 


Let us determine all groups of order 10 up to isomorphism. We know from the Funda- 
meatal Theorem 11.12 that every abelian group of order 10 is isomorphic to Z)9. Suppose 
that G is nonabelian of order 10. By Sylow theory, G contains a normal subgroup 
of order 5, and H must be cyclic. Let a be a generator of H. Then G/A is of order 2 
and thus isomorphic to Z2. If b € G and b ¢ H, we must then have b? € H. Since every 
element of H except 1 has order 5, if b? were not equal to 1, then b* would have order 
5, so b would have order 10. This would mean that G would be cyclic, contradicting our 
assumption that G is not abelian. Thus b? = 1. Finally, since H is a normal subgroup of 
G, bHb"' = H, soinparticular, bab! € H. Since conjugation by b is an automorphism 
of H, bab~ must be another element of H of order 5, hence bab™! equals a, a?, a’, or 
a’. But bab-! =a would give ba = ab, and then G would be abelian, since a and b 
generate G. Thus the possibilities for presentations of G are: 


1. (a,b:@ =1,8 =1,ba=a’*d), 
2. (a,b:@=1,b* =1,ba =a’*b), 
3. (a4,b:@ =1,b7 =1, ba =a‘b). 
Note that all three of these presentations can give groups of order at most 10, since 


the last relation ba = a'b enables us to express every product of a’s and b’s in G in the 
form a°b', Then a? = 1 and b? = 1 show that the set 


S = {a°b®, alb®, a*b®, ab®, atb®, a°b!, a'b!, a*b!, atb!, ato} 


includes all elements of G. 
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It is not yet clear that all these elements in S are distinct, so that we have in all three 
cases a group of order 10. For example, the group presentation 


(a,b:a =1,b* =1,ba =a’b) 
gives a group in which, using the associative law, we have 
a = b’a = (bb)a = b(ba) = b(a*b) = (ba)(ab) 
= (a*b)(ab) = a*(bayb = a?(a*b)b = ab? = at 


Thus in this group, a — a*, so a? = 1, which, together with a° — 1, yields a — 1. But 


a@ = 1, together with a? — 1, means that a = 1. Hence every element in the group with 
presentation 


(a,b: a° =1,b? =1, ba = ab) 
is equal to either 1 or 5; that is, this group is isomorphic to Z). A similar study of 
(bb)a — b(ba) 
for 
(a,b: a° =1,b* =1, ba =a°d) 


shows that a = a‘ again, so this also yields a group isomorphic to Z». 
This leaves just 


(a,b:a° =1,b? =1, ba =a*b) 


as a candidate for a nonabelian group of order 10. In this case, it can be shown that 
all elements of § are distinct, so this presentation does give a nonabelian group G of 
order 10. How can we show that all elements in S$ represent distinct elements of G? 
The easy way is to observe that we know that there is at least one nonabelian group of 
order 10, the dihedral group Ds. Since G is the only remaining candidate, we must have 
G ~ Ds. Another attack is as follows. Let us try to make S$ into a group by defining 
(a*b')\(a"b”) to be a*b”, where x is the remainder of s + u(4‘) when divided by 5, and 
y is the remainder of t + v when divided by 2, in the sense of the division algorithm 
(Theorem 6.3). In other words, we use the relation ba = a*h as a guide in defining the 
product (a‘°b')(a“b’) of two elements of S. We see that a°b° acts as identity, and that 
given a“b”, we can determine f and s successively by letting 


t = —v (mod 2) 
and then 
= —u(4')(mod 5), 


giving a*b', which is a left inverse for a“b”. We will then have a group structure on S if 
and only if the associative law holds. Exercise 13 asks us to carry out the straight-forward 
computation for the associative law and to discover a condition for S to be a group under 
such a definition of multiplication. The criterion of the exercise in this case amounts to 
the valid congruence 


4? = 1 (mod 5). 


40.6 Example 
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Thus we do get a group of order 10. Note that 
2’ # 1 (mod 5) 
and 
3? # 1 (mod 5), 
so Exercise 13 also shows that 
(a,b:a° =1,b* =1, ba =a’b) 
and 
(a,b: a° =1,b? =1, ba =a°b) 
do not give groups of order 10. A 


Let us determine all groups of order 8 up to isomorphism. We know the three abelian 
ones: 


Ze, Z2 x Zia, Zo x Ly x Zo. 


Using generators and relations, we shall give presentations of the nonabelian groups. 

Let G be nonabelian of order 8. Since G is nonabelian, it has no elements of order 8, 
so each element but the identity is of order either 2 or 4. If every element were of order 
2, then for a, b € G, we would have (ab)? = 1, that is, abab = 1. Then since a? = 1 
and b* = 1 also, we would have 


ba = a’bab? = a(ab)*b = ab, 


contrary to our assumption that G is not abelian. Thus G must have an element of 
order 4. 

Let (a) be a subgroup of G of order 4. If b ¢ (a), the cosets (a) and b{a) exhaust all 
of G. Hence a and b are generators for G and a* = 1. Since (a) is normal in G (by Sylow 
theory, or because it is of index 2), G/(a) is isomorphic to Zy and we have b* € (a). If 
b? =a orb? =a’, then b would be of order 8. Hence b? = 1 or b* = a’. Finally, since 
(a) is normal, we have bab7! € (a), and since b(a)b~! is a subgroup conjugate to (a) 
and hence isomorphic to (a), we see that bab~! must be an element of order 4. Thus 
bab~! = a or bab~' = a>. If bab were equal to a, then ba would equal ab, which 
would make G abelian. Hence bab! = a3, so ba = a*b. Thus we have two possibilities 
for G, namely, 


G,: (a,b: a* =1,b? =1, ba =a*b) 
and 
Go:(a,b:a4=1,0 =a’, ba = ab). 
Note that a7! = a3, and that b~! is b in G, and b? in G>. These facts, along with 
the relation ba = a*b, enable us to express every element in G; in the form ab", as 


in Examples 40.3 and 40.5. Since a* = 1 and either b? = 1 or b* = a’, the possible 
elements in each group are 


1, a, a’, a, b, ab, ab, arb. 
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Thus G, and G» each have order at most 8. That G; is a group of order 8 can be seen 
from Exercise 13. An argument similar to that used in Exercise 13 shows that G2 has 
order 8 also. 

Since ba = a*b % ab, we see that both G; and G» are nonabelian. That the two 
groups are not isomorphic follows from the fact that a computation shows that G, has 
only two elements of order 4, namely, a and a?. On the other hand, in G2 all elements 
but 1 and a? are of order 4. We leave the computations of the tables for these groups 
to Exercise 3. To illustrate suppose we wish to compute (a7b)(a*b). Using ba = a°b 
repeatedly, we get 


(a*b)(a*b) = a*(ba)a*b = a*(ba)ab = a*(ba)b = ald’. 
Then for G,, we have 
a''p? = qi = a, 
but if we are in G2, we get 
ap? =a =a. 


The group G; is the octic group and is isomorphic to our old friend, the group D4 
of symmetries of the square. The group G) is the quaternion group; it is isomorphic to 
the multiplicative group {1, —1,i, —i, 7, —j, k, —k} of quaternions. Quaternions were 
discussed in Section 24. A 
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Computations 
1. Give a presentation of Z4 involving one generator; involving two generators; involving three generators. 
2. Give a presentation of 53 involving three generators. 


3. Give the tables for both the octic group 
Gbig =1,b 1, ba a*d) 
and the quaternion group 
(a,b: a* =1,b? =a’, ba =a’b). 


In both cases, write the elements in the order 1, a, a?, a°, b, ab, a*b, a>b. (Note that we do not have to com- 
pute every product. We know that these presentations give groups of order 8, and once we have computed 
enough products the rest are forced so that each row and each column of the table has each element exactly 
once.) 

4. Determine all groups of order 14 up to isomorphism. [Hint: Follow the outline of Example 40.5 and use 
Exercise 13, part (b).] 

5. Determine all groups of order 21 up to isomorphism. [Hint: Follow the outline of Example 40.5 and use 
Exercise 13, part (b). It may seem that there are two presentations giving nonabelian groups. Show that they 
are isomorphic. ] 
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Concepts 


In Exercises 6 and 7, correct the definition of the italicized term without reference to the teat, if correction is needed. 
so that it is in a form acceptable for publication. 


6. 
7. 


A consequence of the set of relators is any finite product of relators raised to powers. 


Two group presentations are isomorphic if and only if there is a one-to-one correspondence of the generators 
of the first presentation with the generators of the second that yields, by renaming generators, a one-to-one 
correspondence of the relators of the first presentation with those of the second. 


. Mark each of the following true or false. 


a. Every group has a presentation. 
b. Every group has many different presentations. 


. Every group has two presentations that are not isomorphic. 


. Every group with a finite presentation is of finite order. 
. Every cyclic group has a presentation with just one generator. 


c. 
d. Every group has a finite presentation. 
€ 
f. 


. Every conjugate of a relator is a consequence of the relator. 


g 
h. Two presentations with the same number of generators are always isomorphic. 


i. Inapresentation of an abelian group, the set of consequences of the relators contains the commutator 
subgroup of the free group on the generators. 


j. Every presentation of a free group has 1 as the only relator. 


Theory 


9. 


10. 


11. 


12. 


13. 


Use the methods of this section and Exercise 13, part (b), to show that there are no nonabelian groups of order 
15. (See also Example 37.10). 


Show, using Exercise 13, that 

(a,b:a2 =1,b? =1,ba =a’b) 
gives a group of order 6. Show that it is nonabelian. 
Show that the presentation 

Gubag Hj 1h = ybe Sad) 


of Exercise 10 gives (up to isomorphism) the only nonabelian group of order 6, and hence gives a group 
isomorphic to $3. 


We showed in Example 15.6 that A, has no subgroup of order 6. The preceding exercise shows that such a 
subgroup of A, would have to be isomorphic to either Ze or 53. Show again that this is impossible by considering 
orders of elements. 


Let 
S={albi|O0<i<m,0<j <n}, 


that is, S consists of all formal products a'b/ starting with a°b° and ending with a”~'b"~!. Let r be a positive 
integer, and define multiplication on S by 


(ab ya"b®) = a°b®, 


where x is the remainder of s + u(r‘) when divided by m, and y is the remainder of t + v when divided by n, 
in the sense of the division algorithm (Theorem 6.3). 
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a. Show that a necessary and sufficient condition for the associative law to hold and for S to be a group under 
this multiplication is that r” = 1 (mod m). 
b. Deduce from part (a) that the group presentation 
(a,b: a" =1,b" =1, ba =a’b) 
gives a group of order mn if and only if r” = 1 (mod m). (See the Historical Note on page xxx.) 


Show that ifn = pq, with p and gq primes and g > p and q = 1 (mod p), then there is exactly one nonabelian 
group (up to isomorphism) of order n. Recall that the ¢ — 1 nonzero elements of Z, forma cyclic group Z,* under 
multiplication modulo gq. [Hint: The solutions of x? = 1 (mod q) form a cyclic subgroup of Z,* with elements 


1,7r,r?,--+,r?-1, In the group with presentation (a, b : a7 = 1, b? = 1, ba = ab), we have bab“! =a’, so 
biab~! = a’. Thus, since b/ generates (b) for j = 1,---, p — 1, this presentation is isomorphic to 


(a, b! :a? =1,(b/)? = 1, (b)a = a” (b/)), 


so all the presentations (a, b : a? = 1, b? = 1, ba = a“”'b) are isomorphic. ] 
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SIMPLICIAL COMPLEXES AND HOMOLOGY GROUPS 


Motivation 


Topology concems sets for which we have enough of an idea of when two points are 
close together to be able to define a continuous function. Two such sets, or topological 
spaces, are structurally the same if there is a one-to-one function mapping one onto the 
other such that both this function and its inverse are continuous. Naively, this means 
that one space can be stretched, twisted, and otherwise deformed, without being torn or 
cut, to look just like the other. Thus a big sphere is topologically the same structure as 
a small sphere, the boundary of a circle the same structure as the boundary of a square, 
and so on. Two spaces that are structurally the same in this sense are homeomorphic. 
Hopefully the student recognizes that the concept of homeomorphism is to topology as 
the concept of isomorphism (where sets have the same algebraic structure) is to algebra. 

The main problem of topology is to find useful, necessary and sufficient conditions, 
other than just the definition, for two spaces to be homeomorphic. Sufficient conditions 
are hard to come by in general. Necessary conditions are a dime a dozen, but some 
are very important and useful. A “nice” space has associated with it various kinds of 
groups, namely homology groups, cohomology groups, homotopy groups, and cohomo- 
topy groups. If two spaces are homeomorphic, it can be shown that the groups of one 
are isomorphic to the corresponding groups associated with the other. Thus a necessary 
condition for spaces to be homeomorphic is that their groups be isomorphic. Some of 
these groups may reflect very interesting properties of the spaces. Moreover, a contin- 
uous mapping of one space into another gives rise to homomorphisms from the groups 


7 Part VII is not required for the remainder of the text. 
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of one into the groups of the other. These group homomorphisms may reflect interesting 
properties of the mapping. 

If the student could make neither head nor tail out of the preceding paragraphs, 
he need not worry. The above paragraphs were just intended as motivation for what 
follows. It is the purpose of this section to describe some groups, homology groups, 
that are associated with certain simple spaces, in our work, usually some subset of the 
familiar Euclidean 3-space R?. 


Preliminary Notions 


First we introduce the idea of an oriented n-simplex in Euclidean 3-space R? for n = 
0, 1, 2,and 3. An oriented 0-simplex is just a point P. An oriented 1-simplex is a directed 
line segment P; P2 joining the points P; and P2 and viewed as traveled in the direction 
from P; to Po. Thus P; P2 4 P:P,. We will agree, however, that P; P) = —P)P,. An 
oriented 2-simplex is a triangular region P, P, P3, as in Fig. 41.1, together with a pre- 
scribed order of movement around the triangle, e.g., indicated by the arrow in Fig. 41.1 
as the order P; P) P;. The order P; P2 P3 is clearly the same order as P) P3P, and P3P) Po, 
but the opposite order from P; P3 P2, P3 PP, and P,P, P3. We will agree that 


P,P) P3 = Po P3P; = P3 P,P) = —P, P3P2 = —P3P2P, = —P2P, Ps. 
Note that P; P; P, is equal to P| P2 P3 if 


1 2 <3 
ij k 


is an even permutation, and is equal to —P; P) P3 if the permutation is odd. The same 
could be said for an oriented 1-simplex P; P). Note also that form = 0, 1, 2, an oriented 
n-simplex is an n-dimensional object. 

The definition of an oriented 3-simplex should now be clear: An oriented 3-simplex 
is given by an ordered sequence P| P2 P3P, of four vertices of a solid tetrahedron, as in 
Fig. 41.2. We agree that P| PoP; P4 = xP; P; P, P,, depending on whether the permuta- 


tion 
123 4 
ijr s 


is even or odd. Similar definitions hold for n > 3, but we shall stop here with dimensions 
that we can visualize. These simplexes are oriented, or have an orientation, meaning 
that we are concerned with the order of the vertices as well as with the actual points 
where the vertices are located. All our simplexes will be oriented, and we shall drop the 


adjective from now on. 
P, AP. P 


41.1 Figure 


2 
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P; 
41.2 Figure 


We are now going to define the boundary of an n-simplex for n = 0, 1,2, 3. The 
term boundary is intuitive. We define the boundary of a 0-simplex P to be the empty 
simplex, which we denote this time by “0.” The notation is 


“ao(P) = 0.” 
The boundary of a 1-simplex P, P, is defined by 
O1(P; P)) = P2 — Pi, 


that is, the formal difference of the end point and the beginning point. Likewise, the 
boundary of a 2-simplex is defined by 


02( P,P; P3) = P,P; — P,P; + P; P2, 


which we can remember by saying that it is the formal sum of terms that we obtain by 
dropping each P; in succession from the 2-simplex P; P;P; and taking the sign to be + 
if the first term is omitted, — if the second is omitted, and + if the third is omitted. 
Referring to Fig. 41.1, we see that this corresponds to going around what we naturally 
would call the boundary in the direction indicated by the orientation arrow. Note also 
that the equation 0,(P; P:) = P2 — P; can be remembered in the same way. Thus we are 
led to the following definition of the boundary of a 3-simplex: 


03( Py P2 P3P4) = P)P3P4 — P, P3P4+ Pi PoP, — P Po P3. 


Similar definitions hold for the definition of 0, for n > 3. Each individual summand of 
the boundary of a simplex is a face of the simplex. Thus, P) P3 P4 is a face of P| Po P3 Ps, 
but P, P3 P, is not a face. However, P; Py P3 = —P, P3P4 is a face of P,P; P3P4. 
Suppose that you have a subset of R° that is divided up “nicely” into simplexes, 
as, for example, the surface S of the tetrahedron in Fig. 41.2, which is split up into four 
2-simplexes nicely fitted together. Thus on the surface of the tetrahedron, we have some 
0-simplexes, or the vertices, of the tetrahedron; some 1-simplexes, or the edges of the 
tetrahedron; and some 2-simplexes, or the triangles of the tetrahedron. In general, for a 
space to be divided up “nicely” into simplexes, we require that the following be true: 


1. Each point of the space belongs to at least one simplex. 
2. Each point of the space belongs to only a finite number of simplexes. 
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41.3 Figure 


3. Two different (up to orientation) simplexes either have no points in common 
or one is (except possibly for orientation) a face of the other or a face of a face 
of the other, etc., or the set of points in common is (except possibly for 
orientation) a face, or a face of a face, etc., of each simplex. 


Condition (3) excludes configurations like those shown in Fig. 41.3. A space divided up 
into simplexes according to these requirements is a simplicial complex. 


Chains, Cycles, and Boundaries 


Let us now describe some groups associated with a simplicial complex X. We shall 
illustrate each definition with the case of the surface S of our tetrahedron in Fig. 41.2. 
The group C,,(X) of (oriented) n-chains of X is the free abelian group generated by the 
(oriented) n-simplexes of X. Thus every element of C,,(X) is a finite sum of the form 
>); m;9;, where the o; are n-simplexes of X and m; € Z. We accomplish addition of 
chains by taking the algebraic sum of the coefficients of each occurrence in the chains 
ef a fixed simplex. 


For the surface S of our tetrahedron, every element of C(S) is of the form 
my Py P3P4 + m2 P; P3 Pq + m3 P; Py Ps + maP; P2P3 
for m; € Z. As an illustration of addition, note that 


(3.P2 P3 Py — SP) P2P3) + (6P2P3P4 — 4P; P3 Pa) 
= 9P, P3 Ps — 4P, P3 Py — 5P, Pp Ps. 


An element of C1(S) is of the form 
my, P; Po + m2 P, Ps + m3P) Py + m4 P2P3 + ms Py Pa + me P3 Pe, 
and an element of Co(S) is of the form 
m,P; +m2P, +m3P3+m4FP4. A 


Now if o is an n-simplex, 0,(a) € C,_-1(X) forn = 1, 2,3. Let us define C_,(X) = 
{0}, the trivial group of one element, and then we will also have dg(a) € C_1(X). Since 
C,,(X) is free abelian, and since we can specify a homomorphism of such a group by 
giving its values on generators, we see that 0, gives a unique boundary homomorphism, 
which we denote again by “d,,” mapping C,,(X) into C,_i(X) for n = 0, 1, 2, 3. 


41.5 Example 


41.6 Example 


41.7 Example 


41.8 Example 
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We have 


On ( Sma = Y > midn(@i). 
For example, 


84(3 Pi Py — 4 Py Ps + 5 Po Pa) = 30)( Py Px) — 401 (P P3) + 501 (Po Pa) 
= 3(P, — P|) — 4(P3 — Pi) + 5(P4 — Po) 
= P| —2P) —4P3+5P4. A 


The student is reminded again that any time you have a homomorphism, two things 
are of great interest, the kernel and the image. The kernel of 9, consists of those 
n-chains with boundary 0. The elements of the kernel are n-cycles. The usual notation 
for the kernel of 0,, that is, the group of n-cycles, is “Z,(X).” 

Ifz = P,P. + Po P3+ P3P,, then 
1(z) = (P, — Pi) + (P3 — Po) + (Pi — Ps) = 0. 
Thus z is a 1-cycle. However, if we let c = P, P) + 2P)P3 + P3Py, then 
81(c) = (Py — Pi) + 2(P3 — Po) + (Pi — P3) = —P2 + P3 #0. 
Thus c ¢ Z;(X). A 


Note that z = P; Po + Po P3 + P;P; of Example 41.6 corresponds to one circuit, or 
cycle, around a triangle with vertices P;, P), and P3. 

The image under 0, the group of (7 — 1)-boundaries, consists exactly of those 
(n — 1)-chains that are boundaries of n-chains. This group is denoted by “B,_1(X).” 


Referring to Example 41.6, we see that if 
P| P3 + 2P,P3+ P3P, 


is a l-chain in C\(X), then P3; — P, is a 0-boundary. Note that P3 — P2 bounds P»P3. 
A 


Let us now compute Z,,(X) and B,,(X) for a more complicated example. In topology, 
if a group is the trivial group consisting just of the identity 0, one usually denotes it by 
“Q” rather than “{0}.? We shall follow this convention. 


Let us compute for n = 0, 1,2 the groups Z,(S) and B,(S) for the surface S of the 
tetrahedron of Fig. 41.2. 

First, for the easier cases, since the highest dimensional simplex for the surface is a 
2-simplex, we have C3(S) = 0, so 


Bo(S) = 03[C3(S)] = 0. 
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Also, since C_;(S) = 0 by our definition, we see that 
Zo(S) = Co(S). 


Thus Zo(S) is free abelian on four generators, P|, P2, P3, and P4. It is easily seen that 
the image of a group under a homomorphism is generated by the images of generators 
of the original group. Thus, since C)($) is generated by P| P2, P| P3, P| Pa, P2P3, P2 Ps, 
and P; P4, we see that By(S) is generated by 


Py — P,, P3— Pi, Pa — Pi, P3 — Po, Py — Po, Pa — P3. 


However, Bo(S) is not free abelian on these generators. For example, P3; — P, = (P3 — 
P|) — (P; — P,). It is easy to see that Bo(S) is free abelian on P, — P,, P3 — P, and 
P A P Il: 

Now let us go after the tougher group Z,(S). An element c of C,(S) is a formal sum 
of integral multiples of edges P; P;. It is clear that d;(c) = 0 if and only if each vertex 
that is the beginning point of a total (counting multiplicity) of r edges of c is also the 
end point of exactly r edges. Thus 


Zy = P)P3+ P3Ps+ P4Po, 
Z2 = Pj Pat P4P3 + P3P,, 
23 = Py P, + PoP t PsP, 
Z4 = Py P3 + P3P)+ PP, 


are all l-cycles. These are exactly the boundaries of the individual 2-simplexes. We 
claim that the z; generate Z;(S). Let z € Z,(S), and choose a particular vertex, say P,; 


«let us work on edges having P, as an end point. These edges are P, P;, P; P3, and P, Py. 


Let the coefficient of P, P; in z be m,;. Then 
Z+M2Z4 — M422 


is again a cycle, but does not contain the edges P,P, or P; Py. Thus the only edge 
having P, as a vertex in the cycle z + m2z4 — m4Z2 is possibly P, P3, but this edge 
could not appear with a nonzero coefficient as it would contribute a nonzero multiple of 
the vertex P, to the boundary, contradicting the fact that a cycle has boundary 0. Thus 
Z+m2Z4 — mM4Z? consists of the edges of the 2-simplex P2P3 P,. Since in a 1-cycle each 
of P, P3, and P4 must serve the same number of times as a beginning and an end point 
of edges in the cycle, counting multiplicity, we see that 


Z+M32Z4 — M422 = 721 


for some integer r. Thus Z;(S) is generated by the z;, actually by any three of the z;. 
Since the z; are the individual boundaries of the 2-simplexes, as we observed, we see 
that 


Z\(S) = Bi(S). 


The student should see geometrically what this computation means in terms of Fig. 41.2. 

Finally, we describe Z2(S). Now C2(S) is generated by the simplexes P2P3 Ps, 
P3P, P4, P| Ps P4, and P,P; P3. If P> P; Ps has coefficient r; and P3; P; Ps has coefficient 
rz in a 2-cycle, then the common edge P3P, has coefficient r; — 2 in its boundary. 


41.9 Theorem 


Proof 


41.10 Corollary 


Proof 


41.11 Definition 


41.12 Example 
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Thus we must have r; = r2, and by a similar argument, in a cycle each one of the four 
2-simplexes appears with the same coefficient. Thus Z2(S) is generated by 


P)P; Py + P3P,P4+ P,P, Py + PoP; Ps, 


that is, Z2(S) is infinite cyclic. Again, the student should interpret this computation geo- 
metrically in terms of Fig. 41.2. Note that the orientation of each summand corresponds 
to going around that triangle clockwise, when viewed from the outside of the tetrahedron. 

A 


&* = 0 and Homology Groups 


We now come to one of the most important equations in all of mathematics. We shall 
stale it only for n = 1, 2, and 3, but it holds for all n > 0. 


Let X be a simplicial complex, and let C,,(X) be the n-chains of X form = 0, 1,2,3. Then 
the composite homomorphism @,_ 10, mapping C,,(X) into C,,_2(X) maps everything 
into 0 form = 1, 2, 3. That is, for each c € C,(X) we have 0,_1(d,(c)) = 0. We use the 
notation “0,10, = 0,” or, more briefly, “a? = 0.” 


Since a homomorphism is completely determined by its values on generators, itis enough 
to check that for an n-simplex o, we have 0,-1(4,(c)) = 0. For n = 1 this is obvious, 
since d9 maps everything into 0. For n = 2, 


1 (02(P} P2 P3)) = 0,(P2P3 — P, P3 + P; P2) 
= (P3 — P2)— (P3 — Pi) + (P2 — Pi) 
=0. 


The case n = 3 will make an excellent exercise for the student in the definition of the 
boundary operator (see Exercise 2). ¢ 


For n = 0, 1, 2, and 3, B,(X) is a subgroup of Z,(X). 


Forn = 0, 1, and 2, we have B,(X) = 4,41[C,41(X)]. Then if b € B,(X), we must have 
b = dn41(c) for some c € C,,4,(X). Thus 


On(b) = On(On41(e)) = 0, 


sob € Z,(X). 
For n = 3, since we are not concerned with simplexes of dimension greater thar. 3. 
B3(X) = 0. ° 


The factor group H,(X) = Z,(X)/B,(X) is the n-dimensional homology group of X. 
a 


Letus calculate H,(S)forn = 0, 1, 2, and 3 and where S is the surface of the tetrahedron 
in Fig. 41.2. 

We found Z,,(S) and B,(S) in Example 41.8. Now C3(S) = 0, so Z3(S) and B3(S) 
are both 0, and hence 


A3(S) = 0. 
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Also, Z2(S) is infinite cyclic and we saw that Bo.(S) = 0. Thus H2(S) is infinite cyclic, 
that is, 


We saw that Z;(S) = Bi(S), so the factor group Z,($)/B,(S) is the trivial group of one 
element, that is, 
A, (S) = 0. 


Finally, Zo(S) was free abelian on P,, P2, P3, and Py, while Bo(S) was generated by 
Py, — Pi, P3 — Py, Pa — Pi, P3 — Po, Py — Po, and P4 — P3. We claim that every coset 
of Zo(S)/ Bo(S) contains exactly one element of the formr P;.Let z € Zp(S), and suppose 
that the coefficient of P2 in z is s2, of P3 is 53, and of P, is sy. Then 


Z— [soCP2 — P,) +.83(P3 — Pi) + sa(Py —- Py) =r Py 


for some r, so z € [r P, + Bo(S)], that is, any coset does contain an element of the form 
r P,. If the coset also contains r’ P,, then r’P, € [r P; + Bo(S)], so (r’ — r)P, is in Bo(S). 
Clearly, the only multiple of P; that is a boundary is zero, sor = r’ and the coset contains 
exactly one element of the form r P,. We may then choose the r P| as representatives of 
the cosets in computing Hp(S). Thus Ho(S) is infinite cyclic, that is, 


Ay(S) = Z. A 


These definitions and computations probably seem very complicated to the student. 
The ideas are very natural, but we admit that they are a bit messy to write down. However, 


the arguments used in these calculations are typical for homology theory, i-e., if you can 
understand them, you will understand all our others. Furthermore, we can make them 


geometrically, looking at the picture of the space. The next section will be devoted to 
further computations of homology groups of certain simple but important spaces. 


4] 


1. Assume that c = 2 P, P3 Py — 4P3 Ps Ps + 3P3P2P;+ P, Po Py is a 2-chain of a certain simplicial complex X. 
a. Compute 02(c). b. Is c a 2-cycle? c. Is d2(c) a 1-cycle? 

2. Compute 0)(03(P; P2 P; P1)) and show that it is 0, completing the proof of Theorem 41.9. 

3. Describe C;(P), Z;(P), B;(P), and H;(P) for the space consisting of just the 0-simplex P. (This is really a 


trivial problem.) 


4. Describe C;(X), Z;(X), B;(X), and H;(X) for the space X consisting of two distinct 0-simplexes, P and P’. 
(Note: The line segment joining the two points is not part of the space.) 


5. Describe C;(X), Z;(X), B;(X), and H;(X) for the space X consisting of the 1-simplex P) P>. 
6. Mark each of the following true or false. 


a. Every boundary is a cycle. 
b. Every cycle is a boundary. 


c. C,,(X) is always a free abelian group. 
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d. B,(X) is always a free abelian group. 


e. Z,(X) is always a free abelian group. 


f. H,,(X) is always abelian. 

g. The boundary of a 3-simplex is a 2-simplex. 

h. The boundary of a 2-simplex is a 1-chain. 

i, The boundary of a 3-cycle is a 2-chain. 

j. If Z,(X) = B,(X), then H,,(X) is the trivial group of one element. 


More Exercises 


7. 


10. 


11. 


12. 


13. 


Define the following concepts so as to generalize naturally the definitions in the text given for dimensions 0, 
1, 2, and 3. 

a. An oriented n-simplex 

b. The boundary of an oriented n-simplex 

c. A face of an oriented n-simplex 


. Continuing the idea of Exercise 7, what would be an easy way to answer a question asking you to define 


C,(X), 8, 1 C,(X) > C,-1(X), Z,(X), and B,(X) for a simplicial complex X perhaps containing some sim- 
plexes of dimension greater than 3? 


. Following the ideas of Exercises 7 and 8, prove that 8? = 0 in general, i.e., that 3,_;(@,(c)) = 0 for every 


c € C,(X), where n may be greater than 3. 


Let X be a simplicial complex. For an (oriented) n-simplex o of X, the coboundary 8)() of o is the (n + 1)- 
chain )> t, where the sum is taken over all (x + 1)-simplexes t that have o as a face. That is, the simplexes 
T appearing in the sum are precisely those that have o as a summand of 0,4(t). Orientation is important 
here. Thus P> is a face of P, P2, but P, is not. However, P; is a face of P, P;. Let X be the simplicial complex 
consisting of the solid tetrahedron of Fig. 41.2. 

a. Compute 5(P,) and 8 (Py). 

b. Compute 5'?(P; P2). 

c. Compute 5'2( P3 Py Py). 

Following the idea of Exercise 10, let X be a simplicial complex, and let the group C“(X) of n-cochains be 
the same as the group C,,(X). 

a. Define 6% : C(X) > C"’T)(X) in a way analogous to the way we defined 9, : C,(X) > Cy-1(X). 

b. Show that 5* = 0, that is, that &"+(8(c)) = 0 for each c € C™(X). 

Following the ideas of Exercises 10 and 11, define the group Z(X) of n-cocycles of X, the group B (X) of 
n-coboundaries of X, and show that BY (X) < Z(X). 


Following the ideas of Exercises 10, 11, and 12, define the n-dimensional cohomology group H(X) of X. 
Compute H(S) for the surface S of the tetrahedron of Fig, 41.2. 


Dee 


COMPUTATIONS OF HOMOLOGY GROUPS 


Triangulations 


Suppose you wish to calculate homology groups for the surface of a sphere. The first thing 
you probably will say, if you are alert, is that the surface of a sphere is not a simplicial 
complex, since this surface is curved and a triangle is a plane surface. Remember that 
two spaces are topologically the same if one can be obtained from the other by bending, 
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twisting, and so on. Imagine our 3-simplex, the tetrahedron, to have a rubber surface 
and to be filled with air. If the rubber surface is flexible, like the rubber of a balloon, 
it will promptly deform itself into a sphere and the four faces of the tetrahedron will 
then appear as “triangles” drawn on the surface of the sphere. This illustrates what is 
meant by a triangulation of a space. The term triangulation need not refer to a division 
into 2-simplexes only, but is also used for a division into n-simplexes for any n > 0. If 
a space is divided up into pieces in such a way that near each point the space can be 
deformed to look like a part of some Euclidean space R” and the pieces into which the 
space was divided appear after this deformation as part of a simplicial complex, then the 
original division of the space is a triangulation of the space. The homology groups of 
the space are then defined formally just as in the last section. 


Invariance Properties 


There are two very important invariance properties of homology groups, the proofs of 
which require quite a lot of machinery, but that are easy for us to explain roughly. First, 
the homology groups of a space are defined in terms of a triangulation, but actually 
they are the same (i.e., isomorphic) groups no matter how the space is triangulated. For 
example, a square region can be triangulated in many ways, two of which are shown in 
Fig. 42.1. The homology groups are the same no matter which triangulation is used to 
compute them. This is not obvious! 


42.1 Figure 


For the second invariance property, if one triangulated space is homeomorphic to 
another (e.g., can be deformed into the other without being torn or cut), the homology 
groups of the two spaces are the same (i.¢., isomorphic) in each dimension n. This is, 
again, not obvious. We shall use both of these facts without proof. 


The homology groups of the surface of a sphere are the same as those for the surface of 
our tetrahedron in Example 41.12, since the two spaces are homeomorphic. A 


Two important types of spaces in topology are the spheres and the cells. Let us 
introduce them and the usual notations. The m-sphere S” is the set of all points a distance 
of | unit from the origin in (x + 1)-dimensional Euclidean space R*!. Thus the 2-sphere 
S? is what is usually called the surface of a sphere in R?, S! is the rim of a circle, and S° 
is two points. Of course, the choice of 1 for the distance from the origin is not important. 
A 2-sphere of radius 10 is homeomorphic to one of radius 1 and homeomorphic to the 
surface of an ellipsoid for that matter. The -cell or n-ball E” is the set of all points 
in R” a distance < 1 from the origin. Thus E? is what you usually think of as a solid 
sphere, E? is a circular region, and E! is a line segment. 


42.3 Example 


42.4 Theorem 


Proof 


42.5 Example 
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The above remarks and the computations of Example 41.12 show that H2(S*) and Hy(S 4 
are both isomorphic to Z, and H,(S”) = 0. A 


Connected and Contractible Spaces 


There is a very nice interpretation of H)(X) for a space X with a triangulation. A space 
is connected if any two points in it can be joined by a path (a concept that we will not 
define) lying totally in the space. If a space is not connected, then it is split up into a 
number of pieces, each of which is connected but no two of which can be joined by a 
path in the space. These pieces are the connected components of the space. 


If a space X is triangulated into a finite number of simplexes, then Ho(X) is isomorphic 
toZxZx---~x Z, and the Betti number m of factors Z is the number of connected 
components of X. 


Now Co(X) is the free abelian group generated by the finite number of vertices P; in the 
triangulation of X. Also, Bg(X) is generated by expressions of the form 
Pi, — P; 


1? 


where P;, P;, is an edge in the triangulation. Fix P;,. Any vertex P;, in the same connected 
component of X as P;, can be joined to P,, by a finite sequence 


Pi, Pigs Piz Pigs 00) Pig Pi, 
of edges. Then 
P, = Pi, + (PF, — Pi) + (Pi, — Pin) +++ + CPi, — Pi), 


showing that P;, € [P;, + Bo(X)]. Clearly, if P;, is not in the same connected component 
with P;,, then P;, ¢ [P;, + Bo(X)], since no edge joins the two components. Thus, if we 
select one vertex from each connected component, each coset of Hp(X) contains exactly 
one representative that is an integral multiple of one of the selected vertices. The theorem 
follows at once. ¢ 


We have at once that 
Ao(S") ~ Z 
for n > 0, since S” is connected for n > 0. However, 
H(S°)~ZxZ 
(see Chapter 41, Exercise 4). Also, 
AoE") =Z@ 
forn > 1. A 


A space is contractible if it can be compressed to a point without being torn or cut, 
but always kept within the space it originally occupied. We just state the next theorem. 
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If X is acontractible space triangulated into a finite number of simplexes, then H,,(X) = 0 
forn > 1. 


It is a fact that S? is not contractible. It is not too easy to prove this fact. The student 
will, however, probably be willing to take it as self-evident that you cannot compress the 
“surface of a sphere” to a point without tearing it, keeping it always within the original 
space S* that it occupied. It is not fair to compress it all to the “center of the sphere.” 
We saw that H2(S*) 4 0 but is isomorphic to Z. 

Suppose, however, we consider H2(E*), where we can regard E? as our solid tetra- 
hedron of Fig. 41.2, for it is homomorphic to E>, The surface S of this tetrahedron 
is homomorphic to S?. The simplexes here for #7 are the same as they are for S (or 
S?), which we examined in Examples 41.8 and 41.12, except for the whole 3-simplex 
o that now appears. Remember that a generator of Z2(S), and hence of Z(E*), was 
exactly the entire boundary of a. Viewed in E°, this is 43(0), an element of B2(E°), so 
Z3(E*) = Bo(E*) and H2(E*) = 0. Since E? is obviously contractible, this is consistent 
with Theorem 42.6. A 


In general, E” is contractible for n > 1, so we have by Theorem 42.6, 
A,(E") = 0 


fori > 0. 


Further Computations 


s We have seen anice interpretation for Ho(X) im Theorem 42.4. As the preceding examples 
illustrate, the 1-cycles in a triangulated space are generated by closed curves of the space 
formed by edges of the triangulation. The 2-cycles can be thought of as generated by 
2-spheres or other closed 2-dimensional surfaces in the space. Forming the factor group 


F(X) = Z1(X)/Bi(X) 


amounts roughly to counting the closed curves that appear in the space that are not there 
simply because they appear as the boundary of a 2-dimensional piece (i.e., a collection of 
2-simplexes) of the space. Similarly, forming H)(X) = Z2(X)/B2(X) amounts roughly 
to counting the closed 2-dimensional surfaces in the space that cannot be “filled in solid” 
within the space, i.e., are not boundaries of some collection of 3-simplexes. Thus for 
H,(S*), every closed curve drawn on the surface of the 2-sphere bounds a 2-dimensional 
piece of the sphere, so H,(S?) = 0. However, the only possible closed 2-dimensional 
surface, S? itself, cannot be “filled in solid” within the whole space S* itself, so H(S?) 
is free abelian on one generator. 


According to the reasoning above, one would expect H,(S') to be free abelian on one gen- 
erator, i.e., isomorphic to Z, since the circle itself is not the boundary of a 2-dimensional 
part of S'. You see, there is no 2-dimensional part of S'. We compute and see whether 
this is indeed so. 

A triangulation of S! is given in Fig. 42.9. Now C(S!) is generated by P, P2, P: P3, 
and P3P;. Ifa 1-chain is a cycle so that its boundary is zero, then it must contain P, P> 


42.10 Example 


Section 42 Computations of Homology Groups 367 


42.9 Figure 


and P,P; the same number of times, otherwise its boundary would contain a nonzero 
multiple of P:. A similar argument holds for any two edges. Thus Z,;(S') is generated 
by P,P. + PoP; + P3P;. Since Bi (S!) = 43[C2(S!)] = 0, there being no 2-simplexes, 
we see that H,(S!) is free abelian on one generator, that is, 


Hy (S') ~ Z. A 


It can be proved that for n > 0, H,(S") and Ho(S") are isomorphic to Z, while 
A,(S") =O for0<i<n. 

To conform to topological terminology, we shall call an element of H,,(X), that is, 
a coset of B,(X) in Z,(X), a “homology class.” Cycles in the same homology class are 
homologous. 


Let us‘compute the homology groups of a plane annular region X between two concentric 
circles. A triangulation is indicated in Fig. 42.11. Of course, since X is connected, it 
follows that 


A(X) ~ Z. 


42.11 Figure 
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If zis any |-cycle, and if P, P2 has coefficient r in z, then z — r42(P, P;Q,) isacycle 
without P; P, homologous to z. By continuing this argument, we find that there is a 1-cycle 
homologous to z containing no edge on the inner circle of the annulus. Using the “outside” 
triangles, we can adjust further by multiples of 02(Q; P; Q ;), and we arrive at z’ containing 
no edge Q; P; either. But then if Qs P; appears in z’ with nonzero coefficient, P, appears 
with nonzero coefficient in 0)(z’), contradicting the fact that z’ is a cycle. Similarly, no 
edge QO; P;; can occur fori = 1, 2,3, 4. Thus z is homologous to a cycle made up of 
edges only on the outer circle. By a familiar argument, such a cycle must be of the form 


A(Q1 Qo + Q203 + O304 + O405 + O51). 
It is then clear that 
Ay(X) = Z. 


We showed that we could “push” any I-cycle to the outer circle. Of course, we could 
have pushed it to the inner circle equally well. 

For H)(X), note that Z2(X) = 0, since every 2-simplex has in its boundary an edge 
on either the inner or the outer circle of the annulus that appears in no other 2-simplex. 
The boundary of any nonzero 2-chain must then contain some nonzero multiples of these 
edges. Hence 

Hy(X) = 0. = 
We shall compute the homology groups of the torus surface X which looks like the 
surface of a doughnut, as in Fig. 42.13. To visualize a triangulation of the torus, imagine 
that you cut it on the circle marked a, then cut it all around the circle marked b, and flatten 
it out as in Fig. 42.14. Then draw the triangles. To recover the torus from Fig. 42.14, join 
the left edge b to the right edge b in such a way that the arrows are going in the same 
direction. This gives a cylinder with circle a at each end. Then bend the cylinder around 
and join the circles a, again keeping the arrows going the same way around the circles. 

Since the torus is connected, H)(X) ~ Z. 

For H)(X), let z be a 1-cycle. By changing z by a multiple of the boundary of 
the triangle numbered 1 in Fig. 42.14, you can get a homologous cycle not containing 
the side / of triangle 1. Then by changing this new 1-cycle by a suitable multiple of the 
boundary of triangle 2, you can further eliminate the side | of 2. Continuing, we can then 


42.13 Figure 


42.16 Figure 
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a 


42.14 Figure 42.15 Figure 


eliminate / of 3, | of 4, / of 5, — of 6, / of 7, | of 8, / of 9, | of 10, / of 11, — of 12,/ of 
13, | of 14,/ of 15, | of 16, and / of 17. The resulting cycle, homologous to z, can. then 
only contain the edges shown in Fig. 42.15. But such a cycle could not contain, with 
nonzero coefficient, any of the edges we have numbered in Fig. 42.15, or it would not 
have boundary 0. Thus z is homologous to a 1-cycle having edges only on the circle a 
or the circle b (refer to Fig. 42.13). By a now hopefully familiar argument, every edge 
on circle a must appear the same number of times, and the same is also true for edges 
on circle b; however, an edge on circle b need not appear the same number of times as 
an edge appears on a. Furthermore, if a 2-chain is to have a boundary just containing a 
and p, all the triangles oriented counterclockwise must appear with the same coefficient 
so that the inner edges will cancel out. The boundary of such a 2-chain is 0. Thus every 
homology class (coset) contains one and only one element 


ra+sb, 


where r and s are integers. Hence H,(X) is free abelian on two generators, represented 
by the two circles a and b. Therefore, 


A(X) ~Z x Z. 


Finally, for H2(X), a 2-cycle must contain the triangle numbered 2 of Fig. 42.14 
with counterclockwise orientation the same number of times as it contains the triangle 
numbered 3, also with counterclockwise orientation, in order for the common edge / of 
these triangles not to be in the boundary. These orientations are illustrated in Fig. 42.16. 
The same holds true for any two adjacent triangles, and thus every triangle with the coun- 
terclockwise orientation must appear the same number of times in a 2-cycle. Clearly, any 
rultiple of the formal sum of all the 2-simplexes, all with counterclockwise orientation, 
is a 2-cycle. Thus Z2(X) is infinite cyclic, isomorphic to Z. Also, B2(X) = 0, there being 
no 3-simplexes, so 


1 


Hy(X) = Z. A 
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@ EXERCISES 42 
In these exercises, you need not write out in detail your computations or arguments. 


Computations 


1. Compute the homology groups of the space consisting of two tangent 1-spheres, i.e., a figure eight. 

2. Compute the homology groups of the space consisting of two tangent 2-spheres. 

3. Compute the homology groups of the space consisting of a 2-sphere with an annular ring (as in Fig. 42.11) that 
does not touch the 2-sphere. 

4. Compute the homology groups of the space consisting of a 2-sphere with an annular ring whose inner circle is 
a great circle of the 2-sphere. 

5. Compute the homology groups of the space consisting of a circle touching a 2-sphere at one point. 

6. Compute the homology groups of the surface consisting of a 2-sphere with a handle (see Fig. 42.17). 


7. Mark each of the following true or false. 


a. Homeomorphic simplicial complexes have isomorphic homology groups. 

b. Iftwo simplicial complexes have isomorphic homology groups, then the spaces are homeomorphic. 
c. S$” is homeomorphic to E”. 

d. H,(X) is trivial for n > Oif X is a connected space with a finite triangulation. 

e. H,(X) is trivial forn > 0 if X is a contractible space with a finite triangulation. 

f. H,(S”) = 0 forn > 0. 
g. 
h. 
iL 
j. 


H,(E") = 0 forn > 0. 

A torus is homeomorphic to 5?. 

A torus is homeomorphic to E’. 

A torus is homeomorphic to a sphere with a handle on it (see Fig. 42.17). 


8. Compute the homology groups of the space consisting of two torus surfaces having no points in common. 


42.17 Figure 42.18 Figure 


11. 
12. 


Section 43. More Homology Computations and Applications 371 


. Compute the homology groups of the space consisting of two stacked torus surfaces, stacked as one would 


stack two inner tubes. 


. Compute the homology groups of the space consisting of a torus tangent to a 2-sphere at all points of a great 


circle of the 2-sphere, i.e., a balloon wearing an inner tube. 


Compute the homology groups of the surface consisting of a 2-sphere with two handles (see Fig. 42.18). 


Compute the homology groups of the surface consisting of a 2-sphere with n handles (generalizing Exercises 
6 and 11. 


43.1 Example 


Moret HoMoLoGy COMPUTATIONS AND APPLICATIONS 


One-Sided Surfaces 


Thus far all the homology groups we have found have been free abelian, so that there 
were no nonzero elements of finite order. This can be shown always to be the case for the 
homology groups of a closed surface (i.e., a surface like S*, which has no boundary) that 
has two sides. Our next example is of a one-sided closed surface, the Klein bottle. Here 
the 1-dimensional homology group will have a nontrivial torsion subgroup reflecting the 
twist in the surface. 


Let us calculate the homology groups of the Klein bottle X. Figure 43.2 represents the 
Klein bottle cut apart, just as Fig. 42.14 represents the torus cut apart. The only difference 
is that the arrows on the top and bottom edge a of the rectangle are in opposite directions 
this time. To recover a Klein bottle from Fig. 43.2, again bend the rectangle joining the 
edges fabeled b so that the directions of the arrows match up. This gives a cylinder that 
is shown somewhat deformed, with the bottom end pushed a little way up inside the 
cylinder, in Fig. 43.3. Such deformations are legitimate in topology. Now the circles a 
must be joined so that the arrows go around the same way. This cannot be done in R°. 
You must imagine that you are in R‘, so that you can bend the neck of the bottle around 
and “through” the side without intersecting the side, as shown in Fig. 43.4. With a little 
thought, you can see that this resulting surface really has only one side. That is, if you 


a 


43.2 Figure 


372 Part VHI Groups in Topology 


43.3 Figure 43.4 Figure 


start at any place and begin to paint “one side,” you will wind up painting the whole 
thing. There is no concept of an inside of a Klein bottle. 

We can calculate the homology groups of the Klein bottle much as we calculated 
the homology groups of the torus in Example 42.12, by splitting Fig. 43.2 into triangles 
exactly as we did for the torus. Of course, 


? Ho(X) ~ Z, 


since X is connected. As we found for the torus, if we triangulate the Klein bottle by 
dividing Fig. 43.2 into triangles, every 1-cycle is homologous to a cycle of the form 


ra+sb 


for r and s integers. If a 2-chain is to have a boundary containing just a and b, again, 
all the triangles oriented counterclockwise must appear with the same coefficient so that 
the inner edges will cancel each other. In the case of the torus, the boundary of such a 
2-chain was 0. Here, however, it is k(2a), where k is the number of times each triangle 
appears. Thus 1) (X) is an abelian group with generators the homology classes of a and 
b and the relations a + b = b + a and 2a = 0. Therefore, 


A\(X)~Z x Z, 


a group with torsion coefficient 2 and Betti number 1. Our argument above regarding 
2-chains shows that there are no 2-cycles this time, so 


ih(X) =0. A 


A torsion coefficient does not have to be present in some homology group of a one- 
sided surface with boundary. Mostly for the sake of completeness, we give this standard 
example of the Mébius strip. 


43.5 Example 
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Let X be the M6bius strip, which we can form by taking a rectangle of paper and joining 
the two ends marked a with a half twist so that the arrows match up, as indicated in 
Fig. 43.6. Note that the Mobius strip is a surface with a boundary, and the boundary is 
just one closed curve (homomorphic to a circle) made up of J and /’. It is clear that the 
Mobius strip, like the Klein bottle, has just one side, in the sense that if you were asked 
to color only one side of it, you would wind up coloring the whole thing. 

Of course, since X is connected, 


Let z be any 1-cycle. By subtracting in succession suitable multiples of the triangles 
numbered 2, 3, and 4 in Fig. 43.6, we can eliminate edges/of triangle 2, | of triangle 3. 


and \ of triangle 4. Thus z is homologous to a cycle z’ having edges on only /, J’, and 
a, and as before, both edges on /’ must appear the same number of times. But if c is a 
2-chain consisting of the formal sum of the triangles oriented as shown in Fig. 43.6, we 
see that 02(c) consists of the edges on/ and /’ plus 2a. Since both edges on /' must appear 
in z’ the same number of times, by subtracting a suitable multiple of d2(c), we see that 
z is homologous to a cycle with edges just lying on / and a. By a familiar argument, 
all these edges properly oriented must appear the same number of times in this new 
cycle, and thus the homology class containing their formal sum is a generator for H1(X). 
Therefore, 


A(X) ~Z. 
This generating cycle starts at Q and goes around the strip, then cuts across it at P via 
a, and arrives at its starting point. 
If z” were a 2-cycle, it would have to contain the triangles 1, 2, 3, and 4 of Fig. 


43.6 an equal number r of times with the indicated orientation. But then 02(z”) would 
be r(2a +1 +1’) £ 0. Thus Z3(X) = 0, so 


I(X) = 0. A 
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43.6 Figure 


The Euler Characteristic 


Let us turn from the computation of homology groups to a few interesting facts and 
applications. Let X be a finite simplicial complex (or triangulated space) consisting 
of simplexes of dimension 3 and less. Let mo be the total number of vertices in the 
triangulation, n; the number of edges, m2 the number of 2-simplexes, and n3 the number 
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of 3-simplexes. The number 
3 . 
Ag — My +n — 713 = So (din 
i=0 


can be shown to be the same no matter how the space X is triangulated. This number 
is the Euler characteristic x(X) of the space. We just state the following fascinating 
theorem. 


Let X be a finite simplicial complex (or triangulated space) of dimension <3. Let x(X) 
be the Euler characteristic of the space X, and let 8; be the Betti number of H j(X). Then 


x(X) = So (-D/B;. 
j=0 


This theorem holds also for X of dimension greater than 3, with the obvious extension 
of the definition of the Euler characteristic to dimension greater than 3. 


Consider the solid tetrahedron E? of Fig. 41.2. Here np =4, 1; =6, n2 = 4, andn3 = 1, 
so 
(EB) =4-644-1=1. 


Remember that we saw that H3(E°) = A>(E?) = H,(E*) =0 and Ho(E*) ~ Z. Thus 


+ B3 = Po = Py = Oand By = 1, so 


3 
>> (1)! B; = 1 = x(E%). 
j=0 


For the surface S? of the tetrahedron in Fig. 41.2, we have np = 4,n, = 6, no = 4, 
and n3 = 0, so 


x(S°) =4-644=2. 
Also, H3(S*) = Hj(S?) = 0, and H>(S*) and Ho(S?) are both isomorphic to Z. Thus 
Bs = B, = Oand Bo = By = 1, so 

3 


>> (1B; = 2 = x(8?). 


j=0 
Finally, for S! in Fig. 42.9, no = 3, 1, = 3, and n2 =n; =0, so 
x(S') =3-3=0. 
Here H,(S') and Ho(S') are both isomorphic to Z, and H3(S!) = H>(S!) = 0. 
Thus Bp = 6, = 1 and f) = 63 = 0, giving 


3 
> (1 Bj = 0 = x(8)). rN 
j=0 


43.9 Example 
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Mappings of Spaces 


A continuous function f mapping a space X into a space Y gives rise toa homomorphism 
fen Mapping H,,(X) into H,(Y) for n > 0. The demonstration of the existence of this 
homomorphism takes more machinery than we wish to develop here, but let us attempt 
to describe how these homomorphisms can be computed in certain cases. The following 
is true: 


If z € Z,(X), and if f(z), regarded as the result of picking up z and setting it down 
in Y in the naively obvious way, should be an n-cycle in Y, then 


Fen + B,(X)) a F®) + BAY). 


That is, if z represents a homology class in H,(X) and f(z) is an n-cycle in Y, 
then f(z) represents the image homology class under f,, of the homology class 
containing z. 


Let us illustrate this and attempt to show just what we mean here by f(z). 


Consider the unit circle 
S={@,y|et+y=1} 


in R?. Any point in S! has coordinates (cos 6, sin@), as indicated in Fig. 43.10. Let 
f :S! = S! be given by 


F (cos @, sin@)) = (cos 39, sin 36). 
Obvieusly, this function f is continuous. Now f should induce 
fat? Hi(S') > Hi(S"). 


Here H,(S') is isomorphic to Z and has as generator the homology class of z = P; P2 + 
P»P3 + P3P,, as seen in Example 42.8. Now if P;, P., and P3 are evenly spaced about 
the circle, then f maps each of the arcs P; P2, P2P3, and P3P; onto the whole perimeter 
of the circle, that is, 


f(P1P2) = f(P2P3) = fCP3Pi) = Pi Pa + PoP3 + P3Pi. 


(cos 6, sin 0) 


Pi 


x 


P3 


| 
| 
| 


43.10 Figure 
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Thus 
Faiz + By(S')) = 3(P Py + PpP3 + Ps Pi) + By(S") 
= 3z + B,(S'), 
that is, f,1 maps a generator of H,(S*) onto three times itself. This obviously reflects 
the fact that f winds S! around itself three times. A 


Example 43.9 illustrates our previous assertion that the homomorphisms of homo- 
logy groups associated with a continuous mapping f may mirror important properties of 
the mapping. 

Finally, we use these concepts to indicate a proof of the famous Brouwer Fixed-Point 
Theorem. This theorem states that a continuous map f of £” into itself has a fixed point, 
ie., there is some x € E” such that f(x) =x. Let us see what this means for E’, a 
circular region. Imagine that you have a thin sheet of rubber stretched out on a table to 
form a circular disk. Mark with a pencil the outside boundary of the rubber circle on the 
table. Then stretch, compress, bend, twist, and fold the rubber in any fashion without 
tearing it, but keep it always within the penciled circle on the table. When you finish, 
some point on the rubber will be over exactly the same point on the table at which it first 
started. 


43.11 Figure 


The proof we outline is good for any n > 1. For n = 1, looking at the graph of 
a function f : E' > E', we find that the theorem simply states that any continuous 
path joming the left and right sides of a square must cross the diagonal somewhere, 
as indicated in Fig. 43.11. The student should visualize the construction of our proof 
with E* having boundary S? and E? having boundary S!. The proof contains a figure 
illustrating the construction for the case of E?. 


(Brouwer Fixed-Point Theorem). A continuous map f of E” into itself has a fixed point 
forn > 1. 


The case n = 1 was considered above. Let f be a map of E” into E” forn > 1. We shall 
assume that f has no fixed point and shall derive a contradiction. 


@ EXERCISES 


Suggested Exercises 
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If f(x) A x for all x € E”, we can consider the line segment from f(x) to x. Let 
us extend this line segment in the direction from f(x) to x until it goes through the 
boundary $”~! of E” at some point y. This defines for us a function g : E” + S*—! with 
g(x) = y, as illustrated in Fig. 43.13. Note that for y on the boundary, we have g(y) = y. 
Now since f is continuous, it is pretty obvious that g is also continuous. (A continuous 
function is roughly one that maps points that are sufficiently close together into points 
that are close together. If x; and x2 are sufficiently close together, then f(x) and f(x2) 
are sufficiently close together so that the line segment joining f(x;) and x; is so close 
to the line segment joining f (x2) and x2 that y, = g(x ,) is close to yz = g(x2).) Then g 
is a continuous mapping of E” into S"~', and thus induces a homomorphism 


8x(n—1) * A, -\(E") = Hy, \(S"74). 


Now we said that H,,(£") = 0, forn > 1, since E” is contractible, and we checked it 
forn = 2andn = 3. Since g,(,—1) is ahomomorphism, we must have g4¢,-1(0) = 0. But 
an (n — 1)-cycle representing the homology class 0 of H,-1(£”) is the whole complex 
S"—! with proper orientation of simplexes, and g(S"~!) = S"—1, since g(y) = y for all 
y € S*-!, Thus 


2en—1 (0) = 8”! + B,_1(S" 1). 
which is a generator 4 0 of H,_1(S"~"), a contradiction. ¢ 


We find the preceding proof very satisfying aesthetically, and hope you agree. 
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1. Verify by direct calculation that both triangulations of the square region X in Fig. 42.1 give the same value for 
the Euler characteristic y(X). 


2. Illustrate Theorem 43.7, as we did in Example 43.8, for cach of the following spaces. 


a. The annular region of Example 42.10 
b. The torus of Example 42.12 
c, The Klein bottle of Example 43.1 
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3. Will every continuous map of a square region into itself have a fixed point? Why or why not? Will every 
continuous map of a space consisting of two disjoint 2-cells into itself have a fixed point? Why or why not? 


4. Compute the homology groups of the space consisting of a 2-sphere touching a Klein bottle at one point. 
5. Compute the homology groups of the space consisting of two Klein bottles with no points in common. 
6. Mark each of the following true or false. 


a. Every homology group of a contractible space is the trivial group of one element. 

b. A continuous map from a simplicial complex X into a simplicial complex Y induces a homomor- 
phism of H,,(X) into H,,(Y). 

c. All homology groups are abelian. 

d. All homology groups are free abelian. 


e. All 0-dimensional homology groups are free abelian. 


f. If a space X has n-simplexes but none of dimension greater than n and H,,(X) 4 0, then H,,(X) is 
free abelian. 


. The boundary of an m-chain is an (n — 1)-chain. 


§ 
h. The boundary of an n-chain is an (7 — 1)-cycle. 


i, The n-boundaries form a subgroup of the n-cycles. 


j. The n-dimensional homology group of a simplicial complex is always a subgroup of the group of 
n-chains. 


More Exercises 
7. Find the Euler characteristic of a 2-sphere with n handles (see Section 42, Exercise 12). 


8. We can form the topological real projective plane X, using Fig. 43.14, by joining the semicircles a so that 
diametrically opposite points come together and the directions of the arrows match up. This cannot be done in 
Euclidean 3-space R>. One must go to R*. Triangulate this space X, starting with the form exhibited in Fig. 
43.14, and compute its homology groups. 


9, The circular disk shown in Fig. 43.14 can be deformed topologically to appear as a 2-sphere with a hole in it, as 
shown in Fig. 43.15. We form the real projective plane from this configuration by sewing up the hole in such a 
way that only diametrically opposite points on the rim of the hole are sewn together. This cannot be done in R?. 

Extending this idea, a 2-sphere with g holes in it, which are then sewn up by bringing together diametrically 
Opposite points on the rims of the holes, gives a 2-sphere with g cross caps. Find the homology groups of 
a 2-sphere with g cross caps. (To see a triangulation, view the space as the disk in Fig. 43.14 but with g — 1 
holes in it to be sewn up as described above. Then triangulate this disk with these holes.) 


a 
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10. 


11. 


12. 
13. 
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Comment: It can be shown that every sufficiently nice closed surface, namely a closed 2-manifold, is 

homeomorphic to a 2-sphere with some number / > 0 of handles if the surface is two sided, and is homeo- 
morphic to a 2-sphere with q > 0 cross caps if it is one sided. The number / or q, as the case may be, is the 
genus of the surface. 
Every point P on a regular torus X can be described by means of two angles @ and ¢, as shown in Fig. 43.16. 
That is, we can associate coordinates (6, @) with P. For each of the mappings f of the torus X onto itself given 
below, describe the induced map f,,, of H,,(X) into H,(X) for n = 0, 1, and 2, by finding the images of the 
generators for H,,(X) described in Example 42.12. Interpret these group homomorphisms geometrically as we 
did in Example 43.9. 


a f:X>X given by f(G@, 6)) = 26, #) 

b f :X 3X given by S(@, 6) = 6, 2¢) 

a f:X ~X given by F(, &)) = (26, 26) 

With reference to Exercise 10, the torus X can be mapped onto its circle b (which is homeomorphic to $!) by 
a variety of maps. For each such map f : X — b given below, describe the homomorphism f,,, : H,(X) > 
H,,(b) for n = 0, 1, and 2, by describing the image of generators of H,(X) as in Exercise 10. 

a. f:X 3b given by FCO, 6) = (6,0) 

b f: Xb given by F (6, 6) = (26,0) 

Repeat Exercise 11, but view the map f asa map of the torus X into itself, inducing maps fy, : H,(X) > H,(X). 


Consider the map f of the Klein bottle in Fig. 43.2 given by mapping a point Q of the rectangle in Fig. 43.2 
onto the point of b directly opposite (closest to) it. Note that b is topologically a 1-sphere. Compute the induced 
maps fxn : H,(X) > H,(b) for n = 0, 1, and 2, by describing images of generators of H,(X). 


HoMOLOGICAL ALGEBRA 


Chain Complexes and Mappings 


The subject of algebraic topology was responsible for a surge in a new direction in 
algebra. You see, if you have a simplicial complex X, then you naturally get chain 
groups C;(X) and maps d;, as indicated in the diagram 


COS Ga 3 mw Gao 
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Groups in Topology 


with 0,_;0, = 0. You then abstract the purely algebraic portion of this situation and 
consider any sequence of abelian groups A; and homomorphisms 6; : Ay > Az_1 such 
that 0,10, = Ofork > 1. So that you do not always have to require k > 1 in d,_; 0, = 0, 
it is convenient to consider ‘doubly infinite” sequences of groups A, for allk € Z. Often, 
A; = 0 fork <0 and k > n in applications. The study of such sequences and maps of 
such sequences is a topic of homological algebra. 


A chain complex (A, 0) is a doubly infinite sequence 
A = {- + ’ Ad, Al, Ao; A_1, A_2, ca ‘} 


of abelian groups A,, together with a collection 0 = {d, |k € Z} of homomorphisms 
such that 0; : Ay — Ag_) and 0,_)0, = 0. | 


As a convenience similar to our notation in group theory, we shall be sloppy and 
let “A” denote the chain complex (A, 0). We can now imitate in a completely algebraic 
setting our constructions and definitions of Section 41. 


If A is a chain complex, then the image under 0; is a subgroup of the kernel of 0,_1. 


Consider 
a ay 
Ag “4 Ap-4 = Ax_2. 


Now 0g-10,% = 0, since A is a chain complex. That is, 3,_1[0;[A,]] = 0. This tells us at 
once that 0,[A,] is contained in the kernel of d,_,, which is what we wished to prove. 
° 


If A is a chain complex, then the kernel Z;,(A) of 0; is the group of k-cycles, and the 
image By(A) = d41[Ag+ i] is the group of k-boundaries. The factor group H,(A) = 
Z;(A)/ B,(A) is the kth homology group of A. t | 


We stated in the last section that for simplicial complexes X and Y, a continuous 
mapping f from X into Y induces a homomorphism of H;(X) into H,(Y). This mapping 
of the homology groups arises in the following way. For suitable triangulations of X and 
Y, the mapping f gives rise to a homomorphism /; of C,(X) into C,(¥), which has the 
important property that it commutes with d;, that is, 


On te = Se-194. 


Let us turn to the purely algebraic situation and see how this induces a map of the 
homology groups. 


(Fundamental Lemma) Let A and A’ with collections 3 and 0’ of homomorphisms 
be chain complexes, and suppose that there is a collection f of homomorphisms f; : 
Ax — Aj as indicated in the diagram 


On42 Cat Ox Ox—4 
© Ags * Ax > Ap_y a ae 


[fer | fe Ife 
9 


Opa) dy k-] 
BA Se AB aN 5 cs 


kt2 gy 
> Arai 


Proof 


44.5 Definition 


44.6 Example 
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Suppose, furthermore, that every square is commutative, that is, 
Fe-1 9% = 9 fr 
for allk. Then f; induces a natural homomorphism f,; : Hy(A) > H;(A’). 


Let z € Z;,(A). Now 


(F(Z) = fe-18x(Z)) = fe-1(0) = 0, 
so f(z) € Z,(A’). Let us attempt to define f,, : H,(A) > H(A’) by 


Far(Z + Be(A)) = fez) + BCA. (1) 


We must first show that f,, is well defined, i.c., independent of our choice of a 
representative of z + B,(A). Suppose that z; € (zg + B,(A)). Then (z; — z) € B,(A), so 
there exists c € Az4, such that z; — z = 044;(c). But then 


Fe(21) — fal2) = falar — 2) = f(r) = 944 (Feri(©)), 


and this last term is an element of 4;., ;[A,.,] = B,(A’). Hence 


F(Z) € (fe(z) + By(A)). 


Thus two representatives of the same coset in H;,(A) = Z;,(A)/B,(A) are mapped into 
representatives of just one coset in H,(A’) = Z;(A')/B,(A’). This shows that fix : 
H(A) > H;(A’) is well defined by equation (1). 

Now we compute f,, by taking f;, of representatives of cosets, and we define the 
group operation of a factor group by applying the group operation of the original group 
to representatives of cosets. It follows at once from the fact that the action of f, on Z;(A) 
is a homomorphism of Z;(A) into Z;,(A’‘) that f,, is a homomorphism of H;(A) into 
A,(A’). 4 


If the collections of maps f, 0, and 0’ have the property, given in Theorem 44.4, that 
the squares are commutative, then f commutes with a. 

After another definition, we shall give a seemingly trivial but very important illus- 
tration of Theorem 44.4. 


A chain complex (A’, 0’) is a subcomplex of a chain complex (A, 3), if, for all k, A, 
is a subgroup of A, and 0;(c) = 6;(c) for every c € Aj, that is, 0, and 3, have the same 
effect on elements of the subgroup Aj, of Ag. L_| 


Let A be a chain complex, and let A’ be a subcomplex of A. Let i be the collection 
of injection mappings i, : Aj, > Ax given by i,(c) = c for c € Aj. It is obvious that i 
commutes with 0. Thus we have induced homomorphisms i,, : H,(A’) - H;,(A). One 
might naturally suspect that i,, must be an isomorphic mapping of H;,(A’) into H;,(A). 
This need not be so! For example, let us consider the 2-sphere S$? as a subcomplex of the 
3-cell E>. This gives rise to in : C2(S?) > Cp(E°) and induces 


iyo 1 Ho(S?) > Hb(E?). 


But we have seen that H>(S*) ~ Z, while H>(E>) = 0. Thus iyo cannot possibly be an 
isomorphic mapping. A 
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Suppose that A’ is a subcomplex of the chain complex A. The topological situation from 
which this arises is the consideration of a simplicial subcomplex Y (in the obvious sense) 
of a simplicial complex X. We can then naturally consider C,(Y) a subgroup of C;,(X), 
just as in the algebraic situation where we have Aj, a subgroup of A,. Clearly, we would 
have 


OICY)] < Ce Y). 


Let us deal now with the algebraic situation and remember that it can be applied to our 
topological situation at any time. 

If A’ is a subcomplex of the chain complex A, we can form the collection A/A’ 
of factor groups A;/Aj;,. We claim that A/A’ again gives rise to a chain complex ina 
natural way, and we must exhibit a collection d of homomorphisms 


Oy : (Ax / Aj) > (Ar-1/Ag_y) 
such that ,_14, = 0. The definition of 4, to attempt is obvious, namely, define 
O(c + Ay) = O(c) + Ay 


for c € Ay. We have to show three things: that & is well defined, that it is a homomor- 
phism, and that 4,_,4 = 0. 

First, to show that 0; is well defined, let c, also be in c + Aj. Then (c1 — c) € Ai, 
80 &(c, — ¢) € Aj_,. Thus 


Ie (C1) € (Oe(C) + Ay_y) 


: also. This shows that 4; is well defined. 


The equation 
Ox((cy + Az) + (C2 + Ay) = Og((Cr + €2) + Aj) 
= Oe +02) + Agy 
= (& (C1) + A (C2) + Ay_y 
= Ig(e1 + Ay) + Se(€2 + Ay) 
shows that 4, is a homomorphism. 
Finally, 
Ix (Bec + Ay)) = Dea (8e(e) + Ay_4) 
= Ip 1(8k(C)) + Ay_g = O+ Ayo, 
SO On—10k = 0. 

The preceding arguments are typical routine computations to the homological alge- 
braist, just as addition and multiplication of integers are routine to you. We gave them in 
great detail. One has to be a little careful to keep track of dimension, i.e., to keep track 
of subscripts. Actually, the expert in homological algebra usually does not write most 
of these indices, but he always knows precisely with which group he is working. We 


gave all the indices so that you could keep track of exactly which groups were under 
consideration. Let us summarize the above work in a theorem. 


44.7 Theorem 


44.8 Definition 


44.9 Example 
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if A’ is a subcomplex of the chain complex A, then the collection A/A’ of factor groups 
A; /A’,, together with the collection 9 of homomorphisms 4; defined by 


B.(c + Aj) = Ae(e) + Ay, 
for c € A;, is a chain complex. 
Since A/A’ is a chain complex, we can then form the homology groups H;(A/A’). 


The homology group H;(A/A’) is the kth relative homology group of A modulo A’. 
| 


In our topological situation where Y is a subcomplex of a simplicial complex X, we 
shall conform to the usual notation of topologists and denote the kth relative homology 
group arising from the subcomplex C(Y) of the chain complex C(X) by “H,(X, Y).” All 
the chains of Y are thus “set equal to 0.” Geometrically, this corresponds to shrinking Y 
to a point. 


Let X be the simplicial complex consisting of the edges (excluding the inside) of the 
triangle in Fig. 44.10, and let Y be the subcomplex consisting of the edge P;P3. We 
have seen that Hi(X) ~ H,(S') ~ Z. Shrinking P> P3 to a point collapses the rim of the 
triangle, as shown in Fig. 44.11. The result is still topologically the same as 5S’. Thus, 
we would expect again to have H\(X, Y) x Z. 

Generators for C;(X) are P; P2, P)P3, and P3P,. Since P;P3 € C;(Y), we see that 
generators of Cy(X)/Ci(Y) are 


PpP2+ CY) and P3P, + C,(¥). 
To find Z\(X, Y) we compute 
d1(n P) Pz + mP3P; + Cy(Y)) = 0)(nP Pr) + 910m P3 Pi) + Co(¥) 
= ACP, — Pi) + m(P1 — Ps) + Col) 
=(m—n)P, + Co(¥), 


since P3, P; € Co(Y). Thus for acycle, we must have m = n, so a generator of Z;(X, Y) 
is (P; P; + P3P,)+ C,(Y). Since B,(X, Y) = 0, we see that indeed 


A(X, Y) =~ Z. 
Since P; + Co(Y) generates Z)(X, Y) and 
0,( P,P, + Ci(Y)) = (Pi — Px) + Co(¥) = Pi + Col), 


we see that Hp(X, Y) = 0. This is characteristic of relative homology groups of dimen- 
sion 0 for connected simplicial complexes. A 


Ps 


P, Py 
44.10 Figure 44.11 Figure 
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Let us consider S! as a subcomplex (the boundary) of E? and compute H)(E?, S'). 
Remember that E? is a circular disk, so S! can be indeed thought of as its boundary (see 
Fig. 44.13). You can demonstrate the shrinking of S! to a point by putting a drawstring 
around the edge of a circular piece of cloth and then drawing the string so that the rim 
of the circle comes in to one point. The resulting space is then a closed bag or S?. Thus, 
while H,(E*) = 0, since E? is a contractible space, we would expect 


H,(E*, S') ~ Z. 
For purposes of computation, we can regard E* topologically as the triangular 
region of Fig. 44.10 and S! as the rim of the triangle. Then C(E”, S!) is generated by 
P; Py P3 + C2(S'), and 


32(P: P2P3 + C2(S")) = d2(P1 P2 P3) + Ci(S") 
= (P,P — Pi Ps + Pi P2) + Ci(S"). 


But (P,P; — P; P3 + P, Pz) € C(S'), so we have 
8p (Pi P,P3 + Cr(S')) = 0. 
Hence P; P) P3 + Co(S') is an element of Z2(E?, S'). Since 
B(E*, S') = 0, 
we sce that 
H)(E”, S') = Z, 


as we expected. A 


The Exact Homology Sequence of a Pair 


We now describe the exact homology sequence of a pair and give an application. We 
shall not carry out all the details of the computations. The computations are routine 
and straightforward. We shall give all the necessary definitions, and shall let the student 
supply the details in the exercises. 


Let A’ be a subcomplex of a chain complex A. Let j be the collection of natural homo- 
morphisms j; : Ay —> (Ax/Aj,). Then 


je-19% = O8 It, 


that is, 7 commutes with 0. 


We leave this easy computation to the exercises (see Exercise 12). 5d 


44.15 Theorem 


Proof 


44.16 Lemma 


Proof 


44.17 Lemma 


Proof 


44.18 Definition 
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The map j; of Lemma 44.14 induces a natural homomorphism 


Jue | H(A) > Hi (A/A’). 


This is immediate from Lemma 44.14 and Theorem 44.4 ¢ 


Let A’ be a subcomplex of the chain complex A. Let h € H,(A/A’). Then fh = 
z+ B,(A/A’) for z € Z,(A/A’), and in turn z = c + Aj, for somec € Ag. (Note that we 
arrive at c from h by two successive choices of representatives.) Now 4;(z) = 0, which 
implies that 0,(c) € A,_,. This, together with 0,.) 0, = 0, gives us dg(c) € Zp_-\(A'). 
Define 


Axe: Hy(A/A') > Hy1(A’) 
by 
Our) = O(c) + Be_1(A’). 


This definition of 0,, looks very complicated. Think of it as follows. Start with an 
element of H,(A/A’). Now such an element is represented by a relative k-cycle modulo 
A’. To say it is a relative k-cycle modulo A’ is to say that its boundary is in Aj_,. 
Since its boundary is in A, _, and is a boundary of something in A,, this boundary 
must be a (k — 1)-cycle in Aj_,. Thus starting with h ¢ H,(A/A’), we have arrived at a 
(k — 1)-cycle representing a homology class in Hy_1(A’). 


The map 0,; : Hy(A/A’) > Hy-1(A’), which we have just defined, is well defined, and 
is ahomomorphism of H,(A/A’) into Hy_(A’). 


We leave this proof to the exercises (see Exercise 13). ¢ 


Let i,, be the map of Example 44.6. We now can construct the following diagram. 


Baked Jack 


oe Fas. 28> ays 2 FAD 


Jak-1 


(A) 2S H(A) 2S mh (A/AD SS |. () 
The groups in diagram (1), together with the given maps, form a chain complex. 


You need only check that a sequence of two consecutive maps always gives 0. We leave 
this for the exercises (see Exercise 14). ¢ 


Since diagram (1) gives a chain complex, we could (horrors!) ask for the homology 
groups of this chain complex. We have been aiming at this question, the answer to which 
is actually quite easy. All the homology groups of this chain complex are 0. You may 
think that such a chain complex is uninteresting. Far from it. Such a chain complex even 
has a special name. 


A sequence of groups A; and homomorphisms 0; forming a chain complex is an exact 
sequence if all the homology groups of the chain complex are 0, that is, if for all k we 
have that the image under 4; is equal to the kernel of 0;_,. 
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Exact sequences are of great importance in topology. We shall give some elementary 
properties of them in the exercises. ay 


The groups and maps of the chain complex in diagram (1) form an exact sequence. 
We leave this proof to the exercises (see Exercise 15). ¢ 


The exact sequence in diagram (1) is the exact homology sequence of the pair (A, A’). 
a 


Let us now give an application of Theorem 44.19 to topology. We have stated without 
proof that H,,(S") ~ Z and H)(S") ~ Z, but that H,(S”) = 0 fork # 0, n. We have also 
stated without proof that H,(£") = 0 for k # 0, since E” is contractible. Let us assume 
the result for E” and now derive from this the result for S”. 

We can view S” as a subcomplex of the simplicial complex E"*!. For example, E”*! 
is topologically equivalent to an (7 + 1)-simplex, and S” is topologically equivalent to 
its boundary. Let us form the exact homology sequence of the pair (E”*!, S$”). We have 


7 Oant) 


Hyai(S") bent Hig) Jantl Hyii(E"*! ; we) 
ee —— 
=0 =0 =Z 


H,,(S") tei H,(E"*!) je. H,(E""1, S$") San, «Det 
—— — NE 


=% =0 =0 
Aas(E"", 5") 2S ast) 2S Bet) 2S, (2) 
el —— ——$— 
=0 =? =6 


‘for 1 <k <n. The fact that E”*! is contractible gives H,(E"*!) = 0 for k > 1. We 
have indicated this on diagram (2). Viewing E”*! as an (n + 1)-simplex and S$” as its 
boundary, we see that C,(E"t!) < C,(S") for k <n. Therefore H,(E"*!, $”) = 0 for 
k <n. We also indicated this on diagram (2). Just as in Example 44.12, one sees that 
Hy.\(E"*', 8") ~ Z, with a generating homology class containing as representative 


P\ Py +++ Pasa + Cy4i(S"). 


For 1 <k <n, the exact sequence in the last row of diagram (2) tells us that 
H,(S") = 0, for from H,(E"+!) = 0, we sce that 
(kernel i...) = Ay(S"). 
But from Hy,4,(E"*!, 8”) = 0, we see that (image 0,44;) = 0. From exactness, (kernel 
ing) = Gmage 0,441), so A, (S") = Oforl <k <n. 
The following chain of reasoning leads to H,,(S”) ~ Z. Refer to diagram (2) above. 
1. Since H,41(E"*!) = 0, we have (image jyn41) = 0. 


2. Hence (kernel 0,,4)) = (image j.n41) = 0 by exactness, that is, d.y41 is an 
isomorphic mapping. 


3. Therefore (image 0,,,4;) ~ Z. 
4. Since H,(E"+') = 0, we have (kernel i,,) = H,(S"). 
5. By exactness, (image 04,41) = (kernel i.,), So H,(CS") & Z. 
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Thus we see that H,,(S”) ~ Z and H,(S") =O for! <k <n. 
Since S” is connected, H)($”) ~ Z. This fact could also be deduced from the exact 


sequence 
Hy(E"*!, ") * Ho(s") => HE") 4S Hye", 8%. 
— |<< —> —<—$ $$ 
=0 ~Z =0 


@ EXERCISES 44 


Suggested Exercises 


1. Let A and B be additive groups, and suppose that the sequence 


0>AS5B 30 
is exact. Show that A ~ B. 
2. Let A, B, and C be additive groups and suppose that the sequence 


0>A5+BS5C 30 


is exact. Show that 


a. j maps B onto C 
b. i is an isomorphism of A into B 
c. C is isomorphic to B/i[A] 
3. Let A, B, C, and D be additive groups and let 
A+B4c4D 
be an exact sequence. Show that the following three conditions are equivalent: 
a. i is onto B 
b. j maps all of B onto 0 
c. k is a one-to-one map 
4. Show that if 
A% p45 cL p43 24% 
is an exact sequence of additive groups, then the following are equivalent: 


a. h and j both map everything onto 0 
b. i is an isomorphism of C onto D 
c. gis onto B and k is one to one 


More Exercises 
5. Theorem 44.4 and Theorem 44.7 are closely connected with Exercise 39 of Section 14. Show the connection. 


6. Ina computation analogous to Examples 44.9 and 44.12 of the text, find the relative homology groups H,,(X, a) 
for the torus X with subcomplex a, as shown in Figs. 42.13 and Fig. 42.14. (Since we can regard these relative 
homology groups as the homology groups of the space obtained from X by shrinking a to a point, these should 
be the homology groups of the pinched torus.) 
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7. 


8. 


9. 


10. 


11. 


12. 
13. 
14. 
15, 


16. 
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For the simplicial complex X and subcomplex a of Exercise 6, form the exact homology sequence of the pair 
(X, a) and verify by direct computation that this sequence is exact. 


Repeat Exercise 6 with X the Klein bottle of Fig. 43.2 and Fig. 43.4. (This should give the homology groups 
of the pinched Klein bottle.) 

For the simplicial complex X and subcomplex a of Exercise 8, form the exact homology sequence of the pair 
(X, a) and verify by direct computation that this sequence is exact. 

Find the relative homology groups H,,(X, Y), where X is the annular region of Fig. 42.11 and Y is the subcomplex 
consisting of the two boundary circles. 

For the simplicial complex X and subcomplex Y of Exercise 10, form the exact homology sequence of the pair 
(X, Y) and verify by direct computation that this sequence is exact. 

Prove Lemma 44.14 

Prove Lemma 44.16 

Prove Lemma 44.17 

Prove Theorem 44.19 by means of the following steps. 

. Show Gmage i,,) C (Kernel j,x). 

. Show (kernel j...) C Gmage i,,). 

. Show (image j..) C (kernel d,;). 

. Show (kernel 0,;.) © (image jx). 

. Show (image 0,;) © (kernel 7,41). 

» Show (kernel i,,_1) © (image @,,). 


>» © © BS BP 


Let (A, d) and (A’, 0’) be chain complexes, and let f and g be collections of homomorphisms f; : Ay > Aj 
and gy: Ay > Ay such that both f and g commute with 3. An algebraic homotopy between f and g isa 
collection D of homomorphisms D;, : A, — Aj,, such that for all c € Az, we have 


filo) — gle) = 8;,,(Dx(c)) + Dy (8 (€)). 


(One abbreviates this condition by f — g = dD + Dd.) Show that if there exists an algebraic homotopy 
between f and g, that is, if f and g are homotopic, then f,; and g,, are the same homomorphism of H;,(A) 
into H;,(A’). 
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Section 47 Gaussian Integers and Multiplicative Norms 


UNIQUE FACTORIZATION DOMAINS 


The integral domain Z is our standard example of an integral domain in which there is 
unique factorization into primes (irreducibles). Section 23 showed that for a field F, F [x] 
is alSo such an integral domain with unique factorization. In order to discuss analogous 
ideas in an arbitrary integral domain, we shall give several definitions, some of which 
are repetitions of earlier ones. It is nice to have them all in one place for reference. 


Let R be a commutative ring with unity and let a, b € R. If there exists c € R such that 
b = ac, then a divides b (or a is a factor of b), denoted by a|b. We read a{b as “a 
does not divide b.” | 


Anelement u of a commutative ring with unity R is a unit of R if u divides 1, that is, if u 
has a multiplicative inverse in R. Two elements a, b € R are associates in R if a = bu, 
where u is a unit in R. 

Exercise 27 asks us to show that this criterion for a and b to be associates is an 
equivalence relation on R. | 


The only units in Z are 1 and —1. Thus the only associates of 26 in Z are 26 and 
—26. A 


A nonzero element p that is not a unit of an integral domain D is an irreducible of D 
if in every factorization p = ab in D has the property that either @ or b is a unit. z= 


Note that an associate of an irreducible p is again an irreducible, for if p = uc for 
a unit u, then any factorization of c provides a factorization of p. 
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@ HIstoricaL NOTE 


he question of unique factorization in an inte- 

gral domain other than the integers was first 
raised in public in connection with the attempted 
proof by Gabriel Lamé (1795-1870) of Fermat’s 
Last Theorem, the conjecture that x” + y” = z” has 
no nontrivial integral solutions for n > 2. It is not 
hard to show that the conjecture is true if it can 
be proved for all odd primes p. At a meeting of 
the Paris Academy on March 1, 1847, Lamé an- 
nounced that he had proved the theorem and pre- 
sented a sketch of the proof. Lamé’s idea was first 
to factor x” + y? over the complex numbers as 


xP 4 y?P = 


(x + y)(x + ay )x + a’y) vert a?-ly) 


where @ is a primitive pth root of unity. He next pro- 
posed to show that if the factors in this expression 
are relatively prime and if x? + y? = z?, then each 
of the p factors must be a pth power. He could then 
demonstrate that this Fermat equation would be true 
for a triple x’, y’, z’, each number smaller than the 
corresponding number in the original triple. This 
would lead to an infinite descending sequence of 
positive integers, an impossibility that would prove 
the theorem. 

After Lamé finished his announcement, how- 
ever, Joseph Liouville (1809-1882) cast serious 
doubts on the purported proof, noting that the con- 
clusion that each of the relatively prime factors 
was a pth power because their product was a pth 
power depended on the result that any integer can 
be uniquely factored into a product of primes. It 


45.5 Definition 


was by no means clear that “integers” of the form 
x + ay had this unique factorization property. Al- 
though Lamé attempted to overcome Liouville’s ob- 
jections, the matter was settled on May 24, when 
Liouville produced a letter from Emst Kummer not- 
ing that in 1844 he had already proved that unique 
factorization failed in the domain Z[a], where a is 
a 23rd root of unity. 

Tt was not until 1994 that Fermat’s Last Theo- 
rem was proved, and by techniques of algebraic 
geometry unknown to Lamé and Kummer. In the 
late 1950s, Yutaka Taniyama and Goro Shimura no- 
ticed a curious relationship between two seemingly 
disparate fields of mathematics, elliptic curves and 
modular forms. A few years after Taniyama’s tragic 
death at age 31, Shimura clarified this idea and 
eventually formulated what became known as the 
Taniyama—Shimura Conjecture. In 1984, Gerhard 
Frey asserted and in 1986 Ken Ribet proved that 
the Taniyama—Shimura Conjecture would imply the 
truth of Fermat’s Last Theorem. But it was finally 
Andrew Wiles of Princeton University who, after 
secretly working on this problem for seven years, 
gave a series of lectures at Cambridge University 
in June 1993 in which he announced a proof of 
enough of the Taniyama—Shimura Conjecture to de- 
tive Fermat’s Last Theorem. Unfortunately, a gap in 
the proof was soon discovered, and Wiles went back 
to work. It took him more than a year, but with the 
assistance of his student Richard Taylor, he finally 
was able to fill the gap. The result was published in 
the Annals of Mathematics in May 1995, and this 
350-year-old problem was now solved. 


An integral domain D is a unique factorization domain (abbreviated UFD) if the 


following conditions are satisfied: 


1. Every element of D that is neither 0 nor a unit can be factored into a product 
of a finite number of irreducibles. 


2. If pi---p, and q,---q, are two factorizations of the same element of D into 
irreducibles, then r = s and the g; can be renumbered so that p; and q; are 


associates. 


45.6 Example 


45.7 Definition 


45.8 Definition 
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Theorem 23.20 shows that for a field F, F[x] is a UFD. Also we know that @ is a CFD: 
we have made frequent use of this fact, although we have never proved ii. For example. 
in Z we have 


24 = (2)(2)(3)(2) = (-2)(—3)(2)2). 


Here 2 and —2 are associates, as are 3 and —3. Thus except for order and associates, the 
irreducible factors in these two factorizations of 24 are the same. A 


Recall that the principal ideal (a) of D consists of all multiples of the element a. 
After just one more definition we can describe what we wish to achieve in this section. 


An integral domain D is a principal ideal domain (abbreviated PID) if every ideal in 
D isa principal ideal. a 


We know that Z is a PID because every ideal is of the form nZ, generated by some 
integer n. Theorem 27.24 shows that if F is a field, then F'[x] is a PID. Our purpose in 
this section is to prove two exceedingly important theorems: 


1. Every PID is a UFD. (Theorem 45.17) 
2. If Dis a UFD, then D[x] is a UFD. (Theorem 45.29) 


The fact that F [x] is a UFD, where F is a field (by Theorem 23.20), illustrates both 
theorems. For by Theorem 27.24, F[x] is a PID. Also, since F has no nonzero elements 
that are not units, F satisfies our definition for a UFD. Thus Theorem 45.29 would give 
another proof that F'[x] is a UFD, except for the fact that we shall actually use Theorem 
23.20 in proving Theorem 45.29. In the following section we shall study properties of a 
certain special class of UFDs, the Euclidean domains. 

Let us proceed to prove the two theorems. 


Every PID Is a UFD 


The steps leading up to Theorem 23.20 and its proof indicate the way for our proof of 
Theorem 45.17. Much of the material will be repetitive. We inefficiently handled the 
special case of F [x] separately in Theorem 23.20, since it was easy and was the only 
case we needed for our field theory in general. 

To prove that an integral domain D is a UFD, it is necessary to show that both 
Conditions 1 and 2 of the definition of a UFD are satisfied. For our special case of 
F [x] in Theorem 23.20, Condition 1 was very easy and resulted from an argument 
that in a factorization of a polynomial of degree > 0 into a product of two nonconstant 
polynomials, the degree of each factor was less than the degree of the original polynomial. 
Thus we couldn’t keep on factoring indefinitely without running into unit factors, that 
is, polynomials of degree 0. For the general case of a PID, it is harder to show that this 
is so. We now turn to this problem. We shall need one more set-theoretic concept. 


If {A; |i € 7} is a collection of sets, then the union U;<,; A; of the sets A; is the set of 
all x such that x € A; for at least onei € I. | 
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Part IX 


45.9 Lemma 


Proof 


45.10 Lemma 


Proof 


45.11 Theorem 


Proof 


Factorization 


Let R be a commutative ring and let N| C No C --- be an ascending chain of ideals N; 
in R. Then N = U;N, is an ideal of R. 


Let a, b € N. Then there are ideals N; and N; in the chain, with a € N, and b € Nj. 
Now either N; C N; or N; © Nj; let us assume that N; C Nj, so both a and b are in 
Nj. This implies that a + b and ab are in Nj, so a + b and ab are in N. Taking a = 0, 
we see that b € N implies —b € N, and 0 € N since 0 € N;. Thus N isa subring of D. 
For a € N and d € D, we must have a € N; for some N;. Then since N; is an ideal, 
da = ad is in Nj. Therefore, da € U;N;,, that is, da € N. Hence N is an ideal. o 


(Ascending Chain Condition for a PID) Let D be a PID. If N; C No C--- is an 
ascending chain of ideals N;, then there exists a positive integer r such that N, = N, for 
all s > r. Equivalently, every strictly ascending chain of ideals (all inclusions proper) in 
a PID is of finite length. We express this by saying that the ascending chain condition 
(ACC) holds for ideals in a PID. 


By Lemma 45.9, we know that V = U;N; is an ideal of D. Now as an ideal in D, which 
is a PID, N = (c) for some c € D. Since N = U;N;, we must have c € N,, for some 
réZ*. Fors > r, we have 


(c) ON, ON, SN = (Cc). 


Thus N, = N, for s >r. 
The equivalence with the ACC is immediate. ° 


¥n what follows, it will be useful to remember that for elements a and b of a domain D, 


(a) © (b) if and only if b divides a, and 
(a) = (b) if and only if a and b are associates. 


For the first property, note that (a) C (b) if and only if a € (b), which is true if and 
only if a = bd for some d € D, so that b divides a. Using this first property, we sce that 
(a) = (b) if and only if a = bc and b = ad for some c,d € D. But then a = adc and 
by canceling, we obtain 1 = dc. Thus d and c are units so a and b are associates. 

We can now prove Condition 1 of the definition of a UFD for an integral domain 
that is a PID. 


Let D be a PID. Every element that is neither 0 nor a unit in D is a product of irreducibles. 


Let a € D, where a is neither 0 nora unit. We first show that a has at least one irreducible 
factor. If a is an irreducible, we are done. If a is not an irreducible, then a = a,b, where 
neither a, nor b; is a unit. Now 


(a) C (at), 


for (a) € (ay) follows from @ = a, b;, and if (a) = (a1), then a and a; would be asso- 


ciates and b, would be a unit, contrary to construction. Continuing this procedure then, 


45.12 Lemma 


Proof 


45.13 Lemma 


Proof 


45.14 Corollary 


Proof 
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starting now with a), we arrive at a strictly ascending chain of ideals 
(a) C (ay) C (a) C-+ 


By the ACC in Lemma 45.10, this chain terminates with some (a,}, and a, must then be 
irreducible. Thus a has an irreducible factor a,. 

By what we have just proved, for an element a that is neither O nor a unit in D, 
either a is irreducible ora = p,c, for p; an irreducible and c; not a unit. By an argument 
similar to the one just made, in the latter case we can conclude that (a) C (c1). If cy is 
not irreducible, then c; = p2c2 for an irreducible py with cz not a unit. Continuing, we 
get a strictly ascending chain of ideals 


(a) C (e1) C (ex) Coe. 


This chain must terminate, by the ACC in Lemma 45.10, with some c, = q, that is an 
irreducible. Then a = pi po ++: Prr- rs 


This completes our demonstration of Condition 1 of the definition of a UFD. Let us 
turn to Condition 2. Our arguments here are parallel to those leading to Theorem 23.20. 
The results we encountcr along the way are of some interest in themselves. 


(Generalization of Theorem 27.25) An ideal (p) in a PID is maximal if and only if 
p is an irreducible. 


Let (p) be a maximal ideal of D, a PID. Suppose that p = ab in D. Then (p) € (a). 
Suppose that (a) = (p). Then a and p would be associates, so b must be a unit. If 
(a) # (p), then we must have (a) = (1) = D, since (p) is maximal. But then a and 1 
are associates, so a is a unit. Thus, if p = ab, either a or b must be a unit. Hence p is an 
irreducible of D. 

Conversely, suppose that p is an irreducible in D. Then if (p) € (a), we must have 
p = ab. Now if ais aunit, then (a) = (1) = D. If a is not a unit, then b must be a unit, 
so there exists u € D such that bu = 1. Then pu = abu = a, so (a) © (p), and we have 
(a) = (p). Thus (p) © (a) implies that either (a) = D or (a) = (p), and (p) ¢ Dor p 
would be a unit. Hence (p) is a maximal ideal. ¢ 


(Generalization of Theorem 27.27) In a PID, if an irreducible p divides ab, then 
either p|a or p[b. 


Let D bea PID and suppose that for an irreducible p in D we have p | ab. Then (ab) € (p). 
Since every maximal ideal in D is a prime ideal by Corollary 27.16, (ab) € (p) implies 
that either a € (p) or b € (p), giving either p|a or p |b. Sa 


If p is an irreducible in a PID and p divides the product a,a2---a, for a; € D, then 
p|4; for at least one i. 


Proof of this corollary is immediate from Lemma 45.13 if we use mathematical induc- 
tion. ° 
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45.15 Definition 


45.16 Example 


45.17 Theorem 


Proof 


Factorization 


A nonzero nonunit element p of an integral domain D is a prime if, for all a, b € D, 
p|ab implies either p|a or p |b. | 


Lemma 45.13 focused our attention on the defining property of a prime. In Exercises 
25 and 26, we ask you to show that a prime in an integral domain is always an irreducible 
and that ina UFD an irreducible is also a prime. Thus the concepts of prime and irreducible 
coincide in a UFD. Example 45.16 will exhibit an integral domain containing some 
irreducibles that are not primes, so the concepts do not coincide in every domain. 


Let F be a field and let D be the subdomain F[x?, xy, y?] of FLx, y]. Then x*, xy, and 
y? are irreducibles in D, but 


(x? )(y*) = @y)ay)@y). 


Since xy divides x*y° but not x? or y?, we see that xy is not a prime. Similar arguments 
show that neither x? nor y? is a prime. A 


The defining property of a prime is precisely what is needed to establish uniqueness 
of factorization, Condition 2 in the definition of a UFD. We now complete the proof of 
Theorem 45.17 by demonstrating the uniqueness of factorization in a PID. 


(Generalization of Theorem 23.20) Every PID is a UFD. 


Theorem 45.11 shows that if D is a PID, then each a € D, where a is neither 0 nor a 
unit, has a factorization 


: a = Pi pr": Pr 
into irreducibles. It remains for us to show uniqueness. Let 
G4=4192°°°Gs 


be another such factorization into irreducibles. Then we have pi |(gig2-+-qs), which 
implies that p, |g; for some j by Corollary 45.14. By changing the order of the q; if 
necessary, we can assume that j = 1 so p;|q1. Then g; = piui, and since p; is an 
irreducible, u; is a unit, so p; and q are associates. We have then 


Pip2-++> Pr = Pi¥ig2- Gs, 
so by the cancellation law in D, 
P2++* Pr = U142°°° 4s. 
Continuing this process, starting with p2 and so on, we finally arrive at 
1 = uyua--- Up Grt1 Gs 
Since the q; are ireducibles, we must have rr = s. . 4 


Example 45.31 at the end of this section will show that the converse to Theorem 
45.17 is false. That is, a UFD need not be a PID. 


45.18 Corollary 


Proof 


45.19 Definition 


45.20 Example 
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Many algebra texts start by proving the following corollary of Theorem 45.17. We 
have assumed that you were familiar with this corollary and used it freely in our other 
work. 


(Fundamental Theorem of Arithmetic) The integral domain Z is a UFD. 


We have seen that all ideals in Z are of the form nZ = (n) for n € Z. Thus Z is a PID. 
and Theorem 45.17 applies. Sd 


It is worth noting that the proof that Z is a PID was really way back in Corollary 6.7. 
We proved Theorem 6.6 by using the division algorithm for Z exactly as we proved, in 
Theorem 27.24, that F [x] is a PID by using the division algorithm for F [x]. In Section 46, 
we shall examine this parallel more closely. 


If D Is a VED, then D[x] Isa UFD 


We now start the proof of Theorem 45.29, our second main result for this section. The 
idea of the argument is as follows. Let D be a UFD. We can form a field of quotients F 
of D. Then F[x] is a UFD by Theorem 23.20, and we shall show that we can recover 
a factorization for f(x) € D[x] from its factorization in F[x]. It will be necessary to 
compare the irreducibles in F[x] with those in D[x], of course. This approach, which 
we prefer as more intuitive than some more efficient modern ones, is essentially due to 
Gauss. 


Let) be a UFD and let a), a2, -++,d, be nonzero elements of D. An element d of D is 
a greatest common divisor (abbreviated gcd) of all of the a; if d|a; fori =1,---,n 
and any other d’ € D that divides all the a; also divides d. | 


In this definition, we called d “a” gcd rather than “the” gcd because gcd’s are only 
defined up to units. Suppose that d and d’ are two gcd’s of a; fori = 1,---,n. Then 
d|d’ and d'|d by our definition. Thus d = q’d’ and d’ = qd for some g, q' € D, so 
ld = q'qd. By cancellation in D, we see that g’q = 1 so q and q’ are indeed units. 

The technique in the example that follows shows that gcd’s exist in a UFD. 


Let us find a gcd of 420, —168, and 252 in the UFD Z. Factoring, we obtain 420 = pre 
3-5-7, -168 = 23 - (—3)- 7, and 252 = 2? -3?- 7. We choose one of these numbers, 
say 420, and find the highest power of each of its irreducible factors (up to associates) 
that divides all the numbers, 420, —168 and 252 in our case. We take as gcd the product 
of these highest powers of irreducibles. For our example, these powers of irreducible 
factors of 420 are 27, 3', 5°, and 7! so we take as gcd d = 4-3-1-7 = 84. The only 
other ged of these numbers in Z is —84, because 1 and —1 are the only units. A 


Execution of the technique in Example 45.20 depends on being able to factor an 
element of a UFD into a product of irreducibles. This can be a tough job, even in Z. 
Section 46 will exhibit a technique, the Euclidean Algorithm, that will allow us to find 
gcd’s without factoring in a class of UFD’s that includes Z and F[x] fora field F. 
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45.21 Definition 


45.22 Example 


45.23 Lemma 


Proof 


45.24 Example 


45.25 Lemma 


Proof 


Factorization 


Let D be a UFD. A nonconstant polynomial 
f(x) = ay t+ayx +---+anx” 


in D[x] is primitive if 1 is a gcd of the a; fori =0,1,---,n. | 


In Z[x], 4x? + 3x + 2 is primitive, but 4x? + 6x + 2 is not, since 2, a nonunit in Z, is a 
common divisor of 4, 6, and 2. A 


Observe that every nonconstant irreducible in D[x] must be a primitive polynomial. 


If D is a UFD, then for every nonconstant f(x) € D[x] we have f(x) = (c)g(x), where 
c € D, g(x) € D{x], and g(x) is primitive. The element c is unique up to a unit factor 
in D and is the content of f(x). Also g(x) is unique up to a unit factor in D. 


Let f(x) € D[x] be given where f(x) is a nonconstant polynomial with coefficients 
do, @,+*+, @,. Let c be a gcd of the a; for i = 0,1,---,. Then for each i, we have 
a; = cq; for some g; € D. By the distributive law, we have f(x) = (c)g(x), where no 
irreducible in D divides all of the coefficients go, g1,---.qn of g(x). Thus g(x) is a 
primitive polynomial. 

For uniqueness, if also f(x) = (DAC) ford € D, h(x) € D{[x], and A(x) primitive, 
then each irreducible factor of c must divide d and conversely. By setting (c)g(x) = 
(d)h(x) and canceling irreducible factors of c into d, we arrive at (u)g(x) = (v)h(x) for 
a unit wu € D. But then vy must be a unit of D or we would be able to cancel irreducible 
factors of v into u. Thus u and v are both units, so c is unique up to a unit factor. From 
f(x) = (c)g(x), we see that the primitive polynomial g(x) is also unique up to a unit 
factor. ¢ 


In Z[x], 
4x? + 6x — 8 = (2)(2x? + 3x — 4), 

where 2x? + 3x — 4 is primitive. A 
(Gauss’s Lemma) If Dis a UFD, then a product of two primitive polynomials in D[x] 
is again primitive. 
Let 

f&) = ao + ax +++ + yx" 
and 

g(x) = bo + Dix +++ + By x™ 


be primitive in D[x], and let h@) = f(x)g(x). Let p be an irreducible in D. Then p 
does not divide all a; and p does not divide all b;, since f(x) and g(x) are primitive. Let 
ay be the first coefficient of f(x) not divisible by p; that is, p|a; fori <r, but p{a, 
(that is, p does not divide a,). Similarly, let p |b; for j < s, but p{b,. The coefficient 
of x5 in h(x) = f()g(x) is 


Cros = (Aobpas +--+ + Gy 15.41) + a;b, + (Gy41D5—1 +--+ + dp4sbq). 


45.26 Corollary 


Proof 


45.27 Lemma 


Proof 
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Now p|a; fort <r implies that 

P| Gobrts +++» + a—-1bs41), 
and also p |b; for 7 < s implies that 

P| Gr410s-1 +--+ + 445b0). 


But p does not divide a, or b,, so p does not divide a,b,, and consequently p does not 
divide c,,;. This shows that given an irreducible p € D, there is some coefficient of 
f (x)g(x) not divisible by p. Thus f(x)g(x) is primitive. ¢ 


If D is a UFD, then a finite product of primitive polynomials in D[x] is again primitive. 
This corollary follows from Lemma 45.25 by induction. ¢ 


Now let D be a UFD and let F be a field of quotients of D. By Theorem 23.20, 
F[x] is a UFD. As we said earlier, we shall show that D[x] is a UFD by carrying a 
factorization in F[x] of f(x) € D[x] back into one in D[x]. The next lemma relates the 
nonconstant irreducibles of D[x] to those of F [x]. This is the last important step. 


Let D be a UFD and let F be a field of quotients of D. Let f(x) € D[x], where (degree 
f(x)) > 0. If f(x) is an irreducible in D[x], then f(x) is also an irreducible in F [x]. 
Also, if f(x) is primitive in D[x] and irreducible in F[x], then f(x) is irreducible in 
D[x]. 


Suppose that a nonconstant f(x) € D[x] factors into polynomials of lower degree in 
F [x], that is, 


J) = r(x)s(x) 


for r(x), s(x) € F[x]. Then since F is a field of quotients of D, each coefficient in 
r(x) and s(x) is of the form a/b for some a, b € D. By clearing denominators, we can 
get 


(A) f(x) = r1(x)s1 (x) 


ford € D, andr,(x), s1(x) € D[x], where the degrees of r; (x) and s,(x) are the degrees 
of r(x) and s(x), respectively. By Lemma 45.23, f(x) = (c)g(x), r(x) = (c))r2(x), and 
51 (%) = (€2)s2{x) for primitive polynomials g(x), r2(x), and s2(x), and c, cy, € D. 
Then 


(dc)g(x) = (c1e2)r2(x) 52x), 


and by Lemma 45.25, r2(x)s2(x) is primitive. By the uniqueness part of Lemma 45.23, 
c1C2 = dcu for some unit u in D. But then 


(dc)g(x) = (deu)ro(x)s2(x), 


398 


Part IX 


45.28 Corollary 


Proof 


45.29 Theorem 
Proof 


Factorization 


sO 
f(x) = (e)g(x) = (cu)re(x)s2(x). 


We have shown that if f (x) factors nontrivially in F [x], then f(x) factors nontrivially 
into polynomials of the same degrees in D[x]. Thus if f(x) € D[x] is irreducible in 
D{x], it must be irreducible in F [x]. 

A nonconstant f(x) € D[x] that is primitive in D{x] and irreducible in F'[x] is also 
irreducible in D[x], since D[x] C F[x]. Sd 


Lemma 45.27 shows that if D is a UFD, the irreducibles in D[x] are precisely 
the irreducibles in D, together with the nonconstant primitive polynomials that are 
irreducible in F [x], where F is a field of quotients of D[x]. 

The preceding lemma is very important in its own right. This is indicated by the 
following corollary, a special case of which was our Theorem 23.11. (We admit that it 
does not seem very sensible to call a special case of a corollary of a lemma a theorem. 
The label assigned to a result depends somewhat on the context in which it appears.) 


If D is a UFD and F is a field of quotients of D, then a nonconstant f(x) € D[x] factors 
into a product of two polynomials of lower degrees r and s in F[x] if and only if it has 
a factorization into polynomials of the same degrees r and s in D[x]. 


It was shown in the proof of Lemma 45.27 that if f(x) factors into a product of two 
polynomials of lower degree in F[x], then it has a factorization into polynomials of the 
same degrees in D[x] (see the next to last sentence of the first paragraph of the proof). 

The converse holds since D[x] C F [x]. Sd 


We are now prepared to prove our main theorem. 


If D is a UFD, then D[x] is a UFD. 


Let f(x) € D[x], where f(x) is neither 0 nor a unit. If f(x) is of degree 0, we are done, 
since D is a UFD. Suppose that (degree f(x)) > 0. Let 


SH) = 81%) 82(%) +++ B(x) 


be a factorization of f(x) in D[x] having the greatest number r of factors of positive 
degree. (There is such a greatest number of such factors because r cannot exceed the 
degree of f (x).) Now factor each g;(x) in the form g;(x) = c;h;(x) where c; is the content 
of g;(x) and h;(x) is a primitive polynomial. Each of the h;(x) is irreducible, because 
if it could be factored, none of the factors could lie in D, hence all would have positive 
degree leading to a corresponding factorzation of g;(x), and then to a factorization of 
Jf (&) with more than r factors of positive degree, contradicting our choice of r. Thus we 
now have 


S(&) = erhi()erho(x) -- + ch, (x) 


where the /;(x) are irreducible in D[x]. If we now factor the c; into irreducibles in D, 
we obtain a factorization of f(x) into a product of itreducibles in D[x]. 

The factorization of f(x) € D[x], where f(x) has degree 0, is unique since D isa 
UFD; see the comment following Lemma 45.27. If f(x) has degree greater than 0, we 
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can view any factorization of f(x) into irreducibles in D[x] as a factorization in F[x] 
into units (that is, the factors in D) and irreducible polynomials in F [x] by Lemma 45.27. 
By Theorem 23.20, these polynomials are unique, except for possible constant factors 
in F. But as an irreducible in D[x], each polynomial of degree >0 appearing in the 
factorization of f(x) in D[x] is primitive. By the uniqueness part of Lemma 45.23, this 
shows that these polynomials are unique in D[x] up to unit factors, that is, associates. 
The product of the irreducibles in D in the factorization of f(x) is the content of f(x). 
which is again unique up to a unit factor by Lemma 45.23. Thus all irreducibles in D[x] 
appearing in the factorization are unique up to order and associates. Sd 


45.30 Corollary If F isa field and x, ---, x, are indeterminates, then F[x1, +--+, X,] is a UFD. 


Proof By Theorem 23.20, F[x)] is a UFD. By Theorem 45.29, so is (F [x] 2] = F'[1, x2]. 
Continuing in this procedure, we see (by induction) that F[x;, ---, %,] is a UFD. Sd 


We have seen that a PID is a UFD. Corollary 45.30 makes it casy for us to give an 
example that shows that not every UFD is a PID. 


45.31 Example Let F be a field and let x and y be indeterminates. Then F[x, y] is a UFD by Corollary 
45.30. Consider the set NV of all polynomials in x and y in F[x, y] having constant term 0. 
Then N is an ideal, but not a principal ideal. Thus F[x, y] is not a PID. A 


Another example of a UFD that is not a PID is Z[x], as shown in Exercise 12, 
Section 46. 


@ EXERCISES 45 


Computations 


In Exercises 1 through 8, determine whether the element is an irreducible of the indicated domain. 


1.5inZ 2. -17inZ 

3. 14inZ 4. 2x —3in Z[x] 

5. 2x — 10 in Z[x] 6. 2x — 3 in Qfx] 

7. 2x — 10 in Qfx] 8. 2x — 10in Zy[x] 

9, If possible, give four different associates of 2x — 7 viewed as an element of Z[x]; of Q[x]; of Z,;[x]. 


10. Factor the polynomial 4x? — 4x + 8 into a product of irreducibles viewing it as an element of the integral 
domain Z[x]; of the integral domain Q[x]; of the integral domain Z,, [x]. 


In Exercises 11 through 13, find all ged’s of the given elements of Z. 
11. 234, 3250, 1690 12. 784, —1960, 448 13. 2178, 396, 792, 594 


In Exercises 14 through 17, express the given polynomial as the product of its content with a primitive polynomial 
in the indicated UFD. 


14. 18x? — 12x + 48 in Z[x] 15. 18x? — 12x + 48 in Q[x] 
16. 2x? — 3x + 6in Z[x] 17. 2x? — 3x +6in Z[x] 
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Concepts 


In Exercises 18 through 20, correct the definition of the italicized term without reference to the text, if correction 
is needed, so that it is in a form acceptable for publication. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


Two elements a and b in an integral domain D are associates in D if and only if their quotient a/b in Disa 
unit. 


An element of an integral domain D is an irreducible of D if and only if it cannot be factored into a product 
of two elements of D. 

An element of an integral domain D is a prime of D if and only if it cannot be factored into a product of two 
smaller elements of D. 

Mark each of the following true or false. 

a. Every field is a UFD. 

b. Every field is a PID. 

c. Every PID is a UFD. 

d. Every UFD is a PID. 

e. Z[x] is a UFD. 

f. Any two irreducibles in any UFD are associates. 

g. 

h. 


If D is a PID, then D[x] is a PID. 
If D is a UFD, then D[x] is a UFD. 
i. In any UFD, if p| a for an irreducible p, then p itself appears in every factorization of a. 
j. A UFD has no divisors of 0. 
Let D be a UFD. Describe the irreducibles in D[x] in terms of the irreducibles in D and the irreducibles in 
F[x], where F is a field of quotients of D. 


Lemma 45,26 states that if D is a UFD with a field of quotients F, then a nonconstant irreducible f(x) of D[x] 
is also an irreducible of F[x]. Show by an example that a g(x) € D[x] that is an irreducible of F[x] need not 
be an irreducible of D[x]. 


All our work in this section was restricted to integral domains. Taking the same definition in this section but for 
a commutative ring with unity, consider factorizations into irreducibles in Z x Z. What can happen? Consider 
in particular (1, 0). 


Theory 


25. 
26. 
27, 


28. 


29. 


30. 
31. 


Prove that if p is a prime in an integral domain D, then p is an irreducible. 
Prove that if p is an irreducible in a UFD, then p is a prime. 


For a commutative ring R with unity show that the relation a ~ b if a is an associate of b (that is, if a = bu 
for wu aunitin R is an equivalence relation on R. 


Let D be an integral domain. Excrcise 37, Section 18 showed that (U, -) is a group where U is the set of units 
of D. Show that the set D* — U of nonunits of D excluding 0 is closed under multiplication. Is this set a group 
under the multiplication of D? 


Let D be a UFD. Show that a nonconstant divisor of a primitive polynomial in D[x] is again a primitive 
polynomial. 


Show that in a PID, every ideal is contained in a maximal ideal. [Hint: Use Lemma 45.10.] 


Factor x? — y? into irreducibles in Q[x, y] and prove that each of the factors is irreducible. 


There are several other concepts often considered that are similar in character to the ascending chain condition on 
ideals in a ring. The following three exercises concern some of these concepts. 
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32. Let R be any ring. The ascending chain condition (ACC) for ideals holds in R if every strictly increasing 
sequence N; C No C N3 C --- of ideals in R is of finite length. The maximum condition (MC) for ideals 
holds in R if every nonempty set § of ideals in R contains an ideal not properly contained in any other ideal of 
the set S. The finite basis condition (FBC) for ideals holds in R if for each ideal N in R, there is a finite set 
By = {b,,---,b,} © N such that N is the intersection of all ideals of R containing By. The set By is a finite 
generating set for V. 

Show that for every ring R, the conditions ACC, MC, and FBC are equivalent. 

33. Let R be any ring. The descending chain condition (DCC) for ideals holds in R if every strictly decreasing 
sequence N, D N2 D N32 --- of ideals in R is of finite length. The minimum condition (mC) for ideals 
holds in R if given any set S of ideals of R, there is an ideal of 5 that does not properly contain any other ideal 
in the set S. 

Show that for every ring, the conditions DCC and mC are equivalent. 


34. Give an example of a ring in which ACC holds but DCC does not hold. (See Exercises 32 and 33.) 


EUCLIDEAN DOMAINS 


We have remarked several times on the importance of division algorithms. Our first 
contact with them was the division algorithm for Z in Section 6. This algorithm was 
immediately used to prove the important theorem that a subgroup of a cyclic group is 
cyclic, that is, has a single generator. Of course, this shows at once that Z is a PID. The 
division algorithm for F(x] appeared in Theorem 23.1 and was used in a completely 
analogous way to show that F[x] is a PID. Now a modern technique of mathematics is to 
take some clearly related situations and to try to bring them under one roof by abstracting 
the important ideas common to them. The following definition is an illustration of this 
technique, as is this whole text! Let us see what we can develop by starting with the 
existence of a fairly general division algorithm in an integral domain. 


46.1 Definition A Euclidean norm on an integral domain D is a function v mapping the nonzero elements 
of D into the nonnegative integers such that the following conditions are satisfied: 


1. Foralla,b € D with b £ O, there exist g andr in D such thata = bq +r, 
where either r = 0 or v(r) < V(b). 


2. For all a,b € D, where neither a nor b is 0, v(a) < v(ab). 


An integral domain D is a Euclidean domain if there exists a Euclidean norm on D. 
| 


The importance of Condition 1 is clear from our discussion. The importance of 
Condition 2 is that it will enable us to characterize the units of a Euclidean domain D. 


46.2 Example The integral domain Z is a Euclidean domain, for the function v defined by v(n) = |n| 
for n # 0 in Z is a Euclidean norm on Z. Condition 1 holds by the division algorithm 
for Z. Condition 2 follows from |ab| = |a||b{ and |a| > | fora 4 0 in Z. A 


46.3 Example If F is a field, then F [x] is a Euclidean domain, for the function v defined by v( f(x) = 
(degree f(x)) for f(x) € F[x], and f(x) ¥ 0is a Euclidean norm. Condition 1 holds by 
Theorem 23.1, and Condition 2 holds since the degree of the product of two polynomials 
is the sum of their degrees. A 
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46.4 Theorem 
Proof 


46.5 Corollary 
Proof 


46.6 Theorem 


Proof 


Factorization 


Of course, we should give some examples of Euclidean domains other than these 
familiar ones that motivated the definition. We shall do this in Section 47. In view of the 
opening remarks, we anticipate the following theorem. 


Every Euclidean domain is a PID. 


Let D be a Euclidean domain with a Euclidean norm v, and let N be an ideal in D. 
If N = {0}, then N = (0) and N is principal. Suppose that N 4 {0}. Then there exists 
b £0 in N. Let us choose b such that v(b) is minimal among all v(m) for n ¢ N. We 
claim that N = (b). Leta € N. Then by Condition 1 for a Euclidean domain, there exist 
q and r in D such that 


a=bq-+r, 


where eitherr = Oorv(r) < v(b). Nowr = a — bg anda, b € N, sothatr € N since N 
is an ideal. Thus v(r) < v(b) is impossible by our choice of b. Hence r = 0, soa = bg. 


Since a was any element of N, we see that N = (b). 5 
A Euclidean domain is a UFD. 

By Theorem 46.4, a Euclidean domain is a PID and by Theorem 45.17, a PID is a 
UFD. Sd 


Finally, we should mention that while a Euclidean domain is a PID by Theorem 46.4, 
not every PID is a Euclidean domain. Examples of PIDs that are not Euclidean are not 
easily found, however. 


Arithmetic in Euclidean Domains 


We shall now investigate some properties of Euclidean domains related to their multi- 
plicative structure. We emphasize that the arithmetic structure of a Euclidean domain 
is not affected in any way by a Euclidean norm v on the domain. A Euclidean norm is 
merely a useful tool for possibly throwing some light on this arithmetic structure of the 
domain. The arithmetic structure of a domain D is completely determined by the set D 
and the two binary operations + and - on D. 

Let D be a Euclidean domain with a Euclidean norm v. We can use Condition 2 of 
a Euclidean norm to characterize the units of D. 


For a Euclidean domain with a Euclidean norm v, v(1) is minimal among all v(a) for 
nonzero a € D, and u € Disa unit if and only if v(v) = v(1). 
Condition 2 for v tells us at once that for a 4 0, 
v1) < va) = v(a). 
On the other hand, if u is a unit in D, then 
v(u) < v(uu!) = v(1). 
Thus 
v(u) = v1) 
for a unit u in D. 
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Conversely, suppose that a nonzero u € D is such that v(u) = v(1). Then by the 
division algorithm, there exist g andr in D such that 


l=ug-tr, 


where either r = 0 or v(r) < vz). But since v(u) = v(1) is minimal over all v(d) for 
nonzero d € D, v(r) < v(u)is impossible. Hencer = Oandl =ug,souisaunit. 


46.7 Example For Z with v(m) = |n|, the minimum of v(n) for nonzero n € Z@ is 1, and 1 and ~1 
are the only elements of Z with v(n) = 1. Of course, 1 and —1 are exactly the units 
of Z. A 
46.8 Example For F[x] with v(f(x)) = (degree f(x)) for f(x) 4 0, the minimum value of v(f(x)) 


for all nonzero f(x) € F[x] is 0. The nonzero polynomials of degree O are exactly the 
nonzero elements of F’, and these are precisely the units of F[x]. A 


We emphasize that everything we prove here holds in every Euclidean domain, in 
particular in Z and F [x]. As indicated in Example 45.20, we can show that any a and b 
ina UFD have a gcd and actually compute one by factoring a and b into irreducibles, but 
such factorizations can be very tough to find. However, if a UFD is actually Euclidean, 
and we know an easily computed Euclidean norm, there is an easy constructive way to 


find gcd’s, as the next theorem shows. 


@ HisroricaL NOTE 


he Euclidean algorithm appears in Euclid’s 

Elements as propositions 1 and 2 of Book VII, 
where it is used as here to find the greatest common 
divisor of two integers. Euclid uses it again in Book 
X (propositions 2 and 3) to find the greatest com- 
mon measure of two magnitudes (if it exists) and to 
determine whether two magnitudes are incommen- 
surable. 

The algorithm appears again in the Brahme- 
sphutasiddhanta (Correct Astronomical System 
of Brahma) (628) of the seventh-century Indian 
mathematician and astronomer Brahmagupta. To 
solve the indeterminate equation rx +c =sy in 
integers, Brahmagupta uses Euclid’s procedure to 
“reciprocally divide” r by s until he reaches the final 
nonzero remainder. By then using, in effect, a sub- 
stitution procedure based on the various quotients 
and remainders, he produces a straightforward al- 
gorithm for finding the smallest positive solution to 
his equation. 


The thirteenth-century Chinese algebraist Qin 
Jiushao also used the Euclidean algorithm in his 
solution of the so-called Chinese Remainder prob- 
lem published in the Shushu jiuzhang (Mathemat- 
ical Treatise in Nine Sections) (1247). Qin’s goal 
was to display a method for solving the system 
of congruences N =r; (mod m,). As part of that 
method he needed to solve congruences of the form 
Nx =1 (mod m), where N and m are relatively 
prime. The solution to a congruence of this form 
is again found by a substitution procedure, differ- 
ent from the Indian one, using the quotients and 
remainders from the Euclidean algorithm applied 
to N and m. It is not known whether the common 
element in the Indian and Chinese algorithms, the 
Euclidean algorithm itself, was discovered indepen- 
dently in these cultures or was learned from Greek 
sources. 


404 


Part IX 


46.9 Theorem 


Proof 


Factorization 


(Euclidean Algorithm) Let D be a Euclidean domain with a Euclidean norm v, and 
let a and b be nonzero elements of D. Let r; be as in Condition 1 for a Euclidean norm, 
that is, 

oe Seo a=bqa+n, 


where either r; = O or v(ry) < vb) r, #0, let r2 be such that 
b=riga tra, 
where either r2 = 0 or v(r2) < v(r,). In general, let r;4; be such that 
h-1 Sng + risis 


where either rj; = 0 or v(7;11) < v(r;). Then the sequence r;, r2, -+- must terminate 
with some r, = 0. Ifr; = 0, then bis a gcd of a and b. Ifr, 4 0 ands, is the first; = 0, 
then a gcd of a and D is r,_}. 

Furthermore, if d is a gcd of a and b, then there exist A and yw in D such that 
d=ha+ pb. 


Since v(r;) < v(r;_-1) and v(x) is a nonnegative integer, it follows that after some finite 
number of steps we must arrive at some r, = 0. 

Ifr; = 0, then a = bg, and b is a gcd of a and b. Suppose r; # 0. Then if d | a and 
d |b, we have 


d|(a— bq), 
sad [r,. However, if dj |r, and dj; | b, then 
dy |(bgqi +11). 


so d, | a. Thus the set of common divisors of a and b is the same set as the set of common 
divisors of b and r;. By a similar argument, if r. 4 0, the set of common divisors of b 
and r, is the same set as the set of common divisors of 7; and r2. Continuing this process, 
we see finally that the set of common divisors of a and b is the same set as the set of 
common divisors of r,_2 and r,;_,, where r, is the first r; equal to 0. Thus a gcd of r,_2 
and r;_; is also a gcd of a and b. But the equation 


Vs—2 = Qsls—1 t+r= Ysls-1 


shows that a ged of r,;_2 and r;_; is rs_y. 

Tt remains to show that we can express a gcd d of a and b as d=dAa+ yb. In 
terms of the construction just given, if d = b, then d = Oa + 1b and we are done. If 
d = rs_;, then, working backward through our equations, we can express each r; in the 
form A;7;-1 + Miti-2 for some A;, “4; € D. To illustrate using the first step, from the 
equation 


P53 = qs—1l's—2 + Ps-1 


we obtain 


d=frs_1 =Ps 3 — Gs—19% s_2. (1) 


46.10 Example 


46.11 Example 
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We then express r;_2 in terms of r,;_3 and r,_4 and substitute in Eq. (1) to express d in 
terms of Ts-3 and r;_4. Eventually, we will have 


d > Aotot wars = A3(b — riqz) + ary = A3b + (3 — Asqairi 
= Agb + (U3 — Asqz)(a — bq) 


which can be expressed in the form d = Aa + yb. If d’ is any other gcd of a and b, then 
d' = ud for some unit u, so d’ = (Au)a + (uu)b. ¢ 


The nice thing about Theorem 46.9 is that it can be implemented on a computer. Of 
course, We anticipate that of anything that is labeled an “algorithm.” 


Let us illustrate the Euclidean algorithm for the Euclidean norm | | on Z by computing a 
ged of 22,471 and 3,266. We just apply the division algorithm over and over again, and 
the last nonzero remainder is a gcd. We label the numbers obtained as in Theorem 46.9 
to further illustrate the statement and proof of the theorem. The computations are easily 
checked. 


a = 22,471 
b = 3,266 
22,471 = (3,266)6 +2.875 ry = 2,875 
3,266 = (2,875)1 +391 ry = 391 
2,875 = (391)7 + 138 r3 = 138 
391 = (138)2 + 115 rg = 115 
138 = (115)1 +23 703 
115 = (23)5 +0 76 = 0 


Thus r5 = 23 is a ged of 22,471 and 3,266. We found a gcd without factoring! This 
is important, for sometimes it is very difficult to find a factorization of an integer into 
primes. A 


Note that the division algorithm Condition 1 in the definition of a Euclidean norm says 
nothing about r being “positive.” In computing a ged in Z by the Euclidean algorithm 
for | |, as in Example 46.10, it is surely to our interest to make |r;| as small as possible 
in each division. Thus, repeating Example 46.10, it would be more efficient to write 


a = 22,47] 
b = 3,266 
22,471 = (3,266)7 — 391 ry = —391 
3,266 = (391)8 + 138 r2 = 138 
391 = (138)3 — 23 r3 = —23 
138 = (23)6 + 0 rg = 0 


We can change the sign of 7; from negative to positive when we wish since the divisors 
of r; and —r; are the same. A 
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EXERCISES 46 


Computations 


In Exercises 1 through 5, state whether the given function v is a Euclidean norm for the given integral domain. 


1. The function v for Z given by v(m) = n? for nonzeron € Z 


2. 


3 


The function v for Z[x] given by v( f(x)) = (degree of f(x)) for f(x) € Z[x], f(x) #0 


. The function v for Z[x] given by v(f(x)) = (the absolute value of the coefficient of the highest degree nonzero 


term of f(x)) for nonzero f(x) € Z[x] 


4, The function v for Q given by v(a) = a’ for nonzero a € Q 


5. The function v for Q given by v(a) = 50 for nonzero a € Q 
6. By referring to Example 46.11, actually express the ged 23 in the form 2.(22,471) + (3,266) for 2, 4 € Z. 


{Hint: From the next to the last line of the computation in Example 46.11, 23 = (138)3 — 391. From the line 
before that, 138 = 3,266 — (391)8, so substituting, you get 23 = [3,266 — (391)8]3 — 391, and so on. That is, 
work your way back up to actually find values for 4 and y.] 


7, Find a gcd of 49,349 and 15,555 in Z. 


8. Following the idea of Exercise 6 and referring to Exercise 7, express the positive gcd of 49,349 and 15,555 in 
Z in the form A(49,349) + (15,555) for A, w € Z. 
9, Find a ged of 
x! — 3x9 4 3x8 — 11x? + 11x® — 1x5 + 19x4 — 13x? + 8x7 - 9x +3 
and 
x® — 3x° 4 3x4 — 9x? 4 5x? —5x +2 
in Q{x]. 
10. Describe how the Euclidean Algorithm can be used to find the ged of n members a), 42, +++, dn of a Euclidean 
domain. 
11. Using your method devised in Exercise 10, find the ged of 2178, 396, 792, and 726. 
Concepts 
12. Let us consider Z[x]. 
a. Is Z[x] a UFD? Why? 
b. Show that {a + xf(x)|a €2Z, f(x) € Z[x]} is an ideal in Z[x]. 
c. Is Z[x] a PID? (Consider part (b).) 
d. Is Z[x] a Euclidean domain? Why? 
13. Mark each of the following true or false. 


. Every Euclidean domain is a PID. 

. Every PID is a Euclidean domain. 

. Every Euclidean domain is a UFD. 

Every UFD is a Euclidean domain. 

. A ged of 2 and 3 in Qis 5. 

. The Buclidean algorithm gives a constructive method for finding a ged of two integers. 

. If v is a Euclidean norm on a Euclidean domain D, then v(1) < v(a) for all nonzero a € D. 


rm oa oe Sf 


14. 
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h. If vis a Euclidean norm on a Euclidean domain D, then v(1) < v(a) for allnonzeroa € D,a #1. 


i. If v is a Euclidean norm on a Euclidean domain D, then v(1) < v(a) for all nonzero nonunits 
aeD. 


j. For any field F, F [x] is a Euclidean domain. 


Does the choice of a particular Euclidean norm v on a Euclidean domain D influence the arithmetic structure 
of D in any way? Explain. 


Theory 


15. 


16. 


17. 


18. 
19. 


20. 


21. 


22. 


. 


23. 


24, 


Let D be a Euclidean domain and let v be a Euclidean norm on D. Show that if a and b are associates in D, 

then v(a) = v(b). 

Let D be a Euclidean domain and let v be a Euclidean norm on D. Show that for nonzero a, b € D, one has 

v(a) < v(ab) if and only if } is not a unit of D. [Hint: Argue from Exercise 15 that v(a) < v(ab) implies that 

bis not a unit of D. Using the Euclidean algorithm, show that v(a) = v(ab) implies (a) = (ab). Conclude that 

if b is not a unit, then v(a) < v(ab).] 

Prove or disprove the following statement: If v is a Euclidean norm on Euclidean domain D, then {a <€ 

D|v(a) > v(1)} U {0} is an ideal of D. 

Show that every field is a Euclidean domain. 

Let v be a Euclidean norm on a Euclidean domain D. 

a. Show that ifs € Z such that s + v(1) > 0, then 7 : D* — Z defined by n(a) = v(a) + s for nonzeroa € D 
is a Euclidean norm on D. As usual, D* is the set of nonzero elements of D. 

b. Show that for t € Z~, 4: D* — Z given by A(a) =f - v(a) for nonzero a € D is a Euclidean norm on D. 

c. Show that there exists a Euclidean norm jz on D such that w(1) = 1 and y(a) > 100 for all nonzero nonunits 
aéeD. 


Let D be a UFD. An element c in D is a least common multiple (abbreviated Iem) of two elements a and 
bin Difa|c, b{c andif c divides every element of D that is divisible by both a and b. Show that every two 
nonzero elements a and b of a Euclidean domain D have an lcm in D. [Hint: Show that all common multiples, 
in the obvious sense, of both a and b form an ideal of D.] 


Use the last statement in Theorem 46.9 to show that two nonzero elements r, s € Z generate the group (Z, +) 
if and only if r and s, viewed as integers in the domain Z, are relatively prime, that is, have a gcd of 1. 


Using the last statement in Theorem 46.9, show that for nonzero a, b,n € Z, the congruence ax = b (mod n) 
has a solution in Z if a and n are relatively prime. 


Generalize Exercise 22 by showing that for nonzero a, b,n € Z, the congruence ax = b (modn) hasa solution 
in Z if and only if the positive gcd of a and n in Z divides b. Interpret this result in the ring Z,. 


Following the idea of Exercises 6 and 23, outline a constructive method for finding a solution in Z of the 
congruence ax = b (mod n) for nonzero a, b,n € Z, if the congruence does have a solution. Use this method 
to find a solution of the congruence 22x = 18 (mod 42). 


GAUSSIAN INTEGERS AND MULTIPLICATIVE NORMS 


Gaussian Integers 


We should give an example of a Euclidean domain different from Z and F'[x]. 


47.1 Definition A Gaussian integer is acomplex number a + bi, where a, b € Z. Fora Gaussian integer 


a =a + bi, the norm N(q) of o is a2 + b?. | 
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Factorization 


We shall let Z[i] be the set of all Gaussian integers. The following lemma gives 
some basic properties of the norm function N on Z[i] and leads to a demonstration that 
the function v defined by v(a~) = N(q@) for nonzero a € Z[i] is a Euclidean norm on 
Z{i]. Note that the Gaussian integers include all the rational integers, that is, all the 


elements of Z. 


= HistoricaL NoTE 


LE his Disquisitiones Arithmeticae, Gauss studied 
in detail the theory of quadratic residues, that is, 
the theory of solutions to the congruence x” = p 
(mod q) and proved the famous quadratic reci- 
procity theorem showing the relationship between 
the solutions of the congruences x? = p (mod q) 
and x? =q (mod p) where p and q are primes. 
In attempting to generalize his results to theories 
of quartic residues, however, Gauss realized that it 
was much more natural to consider the Gaussian 
integers rather than the ordinary integers. 

Gauss’s investigations of the Gaussian integers 
are contained in a long paper published in 1832 in 
which he proved various analogies between them 
and the ordinary integers. For example, after noting 
that there are four units (invertible elements) among 


the Gaussian integers, namely 1, —1,i, and —i, and 
defining the norm as in Definition 47.1, he gener- 
alized the notion of a prime integer by defining a 
prime Gaussian integer to be one that cannot be ex- 
pressed as the product of two other integers, neither 
of them units. He was then able to determine which 
Gaussian integers are prime: A Gaussian integer that 
is not real is prime if and only if its norm is a real 
prime, which can only be 2 or of the form 47 + 1. 
The real prime 2 = (1 + 7)(1 —7) and real primes 
congruent to | modulo 4 like 13 = (2 + 31)(2 — 37) 
factor as the product of two Gaussian primes. Real 
primes of the form 4 + 3 like 7 and 11 are still 
prime in the domain of Gaussian integers. See Ex- 
ercise 10. 


In Z[i], the following properties of the norm function N hold for alla, 6 € Z[i]: 


If we let a = a; + ai and B = b, + bai, these results are all straightforward computa- 
tions. We leave the proof of these properties as an exercise (see Exercise 11).  d 


47.2 Lemma 
1. Nw) > 0. 
2. N(a) = Oif and only ifa = 0. 
3. N(ap) = N(@)N(B). 
Proof 
47.3 Lemma Zi] is an integral domain. 
Proof 


Itis obvious that Z[7] is a commutative ring with unity. We show that there are no divisors 
of 0. Leta, 8 € Zi]. Using Lemma 47.2, if a8 = 0 then 
N(@)N(B) = N(aB) = N(O) = 0. 


Thus ef = 0 implies that N(~) = 0 or M(B) = 0. By Lemma 47.2 again, this im- 
plies that either a = 0 or 8 = 0. Thus Z[i] has no divisors of 0, so Z[i] is an integral 
domain. Sd 


Of course, since Z[i] is a subring of C, where C is the field of complex numbers, 
it is really obvious that Z[i] has no 0 divisors, We gave the argument of Lemma 47.3 to 


47.4 Theorem 


Proof 


47,5 Example 
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illustrate the use of the multiplicative property 3 of the norm function N and to avoid 
going outside of Z[7] im our argument. 


The function v given by v(@) = N(q@) for nonzero w € Z[i] is a Euclidean norm on Z[i]. 
Thus Zf{i] is a Euclidean domain. 


Note that for 8 = b, + boi 4 0, N(b, + boi) = by? + bs”, so N(B) = 1. Then for all 
a, 6 £0in Zi], N(w) < N(a@)N(B) = N(a@B). This proves Condition 2 for a Euclidean 
norm in Definition 46.1. 

It remains to prove the division algorithm, Condition 1, for N. Leta, B € Z[i], with 
a@ = a4, + ai and B = b, + bi, where 6B + 0. We must find o and p in Z[i] such that 
a = Bo + p, where either o = 0 or N(p) < N(f) = by}? + by’. Let a/B =r 4 si for 
r,s € Q Let gq; and q2 be integers in Z as close as possible to the rational numbers r and 
S, respectively. Let o = g, + qzi and p = a — Bo. If p = 0, we are done. Otherwise, 
by construction of o, we see that |r — qi] < 5 and |s — g2| < 5. Therefore 


NG -7) NS) = Gist) 
N ey oO ee as aa 
=N(r—-—q)+G w= (5) +(5) =5- 
Thus we obtain 


N(p) = N(@ — fo) = v(a(S = “)) = veew(= 2 o) < Nis. 


so we do indeed have N(p) < N(f) as desired. 5 


We can now apply all our results of Section 46 to Z[Z]. In particular, since N(1) = 1, 
the units of Z[i] are exactly the w = a, + ai with N(@) = a;* +a)? = 1. From the 
fact that a; and a» are integers, it follows that the only possibilities are a, = £1 with 
dz = 0, or ay = 0 with ay = +1. Thus the units of Z[z] are +1 and +i. One can also 
use the Euclidean Algorithm to compute a gcd of two nonzero elements, We leave 
such computations to the exercises. Finally, note that while 5 is an irreducible in Z, 
5 is no longer an irreducible in Z[i], for 5 = (1 + 2i)(1 — 27), and neither 1 + 2i nor 
1 — 2i is a unit. A 


Multiplicative Norms 


Let us point out again that for an integral domain D, the arithmetic concepts of irre- 
ducibles and units are not affected in any way by a norm that may be defined on the 
domain. However, as the preceding section and our work thus far in this section show, a 
suitably defined norm may be of help in determining the arithmetic structure of D. This 
is strikingly illustrated in algebraic number theory, where for a domain of algebraic 
integers we consider many different norms of the domain, each doing its part in helping 
to determine the arithmetic structure of the domain. In a domain of algebraic integers, 
we have essentially one norm for each irreducible (up to associates), and each such norm 
gives information concerning the behavior in the integral domain of the irreducible to 


aaa... E|_YS- .....}©| 
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47.6 Definition 


47.7 Theorem 


Proof 


47.8 Example 


47,9 Example 


Factorization 


which it corresponds. This is an example of the importance of studying properties of 
elements in an algebraic structure by means of mappings associated with them. 

Let us study integral domains that have a multiplicative norm satisfying Properties 
9 and 3 of N on Z[i| given in Lemma 47.2. 


Let D be an integral domain. A multiplicative norm N on D is a function mapping D 
into the integers Z such that the following conditions are satisfied: 


1. N(a) = Oif and only ifa = 0. 

2. N(aB) = N@N(B) for alla, B € D. = 
If D is an integral domain with a multiplicative norm N, then N(1) = 1 and |N(u)| = | 
for every unit u in D. If, furthermore, every a such that | («)| = 1 is a unit in D, then 
an element x in D, with |N (r)| = p for a prime p € Z, is an irreducible of D. 


Let D be an integral domain with a multiplicative norm N. Then 
NG) = N(@)A)) = NON 
shows that N(1) = 1. Also, if uw is a unit in D, then 
1 = Nd) = Nu) = NWN”). 


Since N(u) is an integer, this implies that |N (u)| = 1. 
, Now suppose that the units of D are exactly the elements of norm +1. Let 7 <¢ D 
be such that |N(r)| = p, where p isa prime in Z. Then if x = af, we have 


p= |N(x)l = IN@N(P)I, 


so either |[N(a)| = 1 or |N (B)| = 1. By assumption, this means that either a or B is a 
unit of D. Thus 7 is an irreducible of D. ¢ 


On Zi], the function N defined by N(a + bi) = a2 +P gives a multiplicative norm 
in the sense of our definition. We saw that the function v given by v(@) = N(a) for 
nonzero w € Zi] is a Euclidean norm on Zi], so the units are precisely the elements a 
of Z{i] with N(@) = N (1) = 1. Thus the second part of Theorem 47.7 applies in Zl]. We 
saw in Example 47.5 that 5 is not an irreducible in Z[i], for 5 = A + 2i)(1 — 2i). Since 
NO +2i)=N(Q - 2i) = 12 4 22 = 5and5isa prime in Z, we see from Theorem 47.7 
that 1 + 2i and 1 — 2i are both irreducibles in Z[i]. 

As an application of multiplicative norms, we shall now give another example of an 
integral domain that is nota UED. We saw one example in Example 45.16. The following 
is the standard illustration. 


Let ZL. /—5] = {a + ib/5\a,b € Z}. Asa subset of the complex numbers closed under 
addition, subtraction, and multiplication, and containing 0 and 1, Z[/—5] is an integral 
domain. Define N on Z[/—5] by 


Na+bV-5) =a + 5b. 


47.10 Theorem 


Proof 
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(Here /—5 =iVJ5.) Clearly, N(w) =0 if and only if a =a+b./—5 =0. That 
N(aB) = N(a)N(B) is a straightforward computation that we leave to the exercises 
(see Exercise 12). Let us find all candidates for units in Z[./—5] by finding all ele- 
ments a in Z[L./—5] with N(w) = 1. Ifa =a+b./—5, and N(a) = 1, we must have 
a’ + 5b? = 1 for integers a and b. This is only possible if b = 0 anda = +1. Hence +1 
are the only candidates for units. Since +1 are units, they are then precisely the units in 
ZLV/—5]. 
Now in Z[./—5], we have 21 = (3)(7) and also 


21 = (14+2/—5)(1 — 2/—5). 


If we can show that 3, 7, 1 + 2,/—5, and 1 — 2./—5 are all irreducibles in Z[./—5], we 
will then know that Z[./—5] cannot be a UFD, since neither 3 nor 7 is (1 + 2./—S). 
Suppose that 3 = a#f. Then 


9 = NG) = N@)N(B) 


shows that we must have N(q@) = 1, 3, or 9. If N(aw) = 1, then e@ is a unit. If a = 
a+ b./-—5, then N(a) = a? + 5b’, and for no choice of integers @ and b is N(w) = 3. 
If N(a) = 9, then N(8) = 1, so B is a unit. Thus from 3 = wf, we can conclude that 
either a or # is a unit. Therefore, 3 is an irreducible in Z[./—5]. A similar argument 
shows that 7 is also an irreducible in Z[./—5]. 

If 1+ 2./—5 = y6, we have 

21 = N11 +2V—5) = M(y)N(6). 

so N(v) = 1, 3, 7, or 21. We have seen that there is no element of Z[./—5] of norm 
3 or’7. This either N(yv) = 1, and y is a unit, or N(v) = 21, so N(6) = 1, and 6 isa 
unit. Therefore, 1 + 2./—5 is an irreducible in Zi/—5].A parallel argument shows that 
1 — 2,/—5 is also an irreducible in Z[./—5]. 

In summary, we have shown that 


Z[V—5] = {a t+ ibV5 |a, b € Z} 
is an integral domain but not a UFD. In particular, there are two different factorizations 
21=3-7=(14+2V—5)\(1 — 2V-5) 
of 21 into irreducibles. These irreducibles cannot be primes, for the property of a prime 
enables us to prove uniqueness of factorization (see the proof of Theorem 45.17). & 


We conclude with a classical application, determining which primes p in Z are equal 
to a sum of squares of two integers in Z. For example, 2 = 1° + 17,5 = 17 + 2?, and 
13 = 2? + 3? are sums of squares. Since we have now answered this question for the 
only even prime number, 2, we can restrict ourselves to odd primes. 


(Fermat’s p — a* + b? Theorem) Let p be an odd prime in Z. Then p = a” + D? for 
integers a and b in Zif and only if p = 1 (mod 4). 


First, suppose that p = a? + b*. Now a and b cannot both be even or both be odd since 
p is an odd number. If a = 2r and b = 2s + 1, then a? + b* = 4r? + 4(s* +5) +1, so 
p = | (mod 4). This takes care of one direction for this “if and only if” theorem. 
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For the other direction, we assume that p = | (mod 4). Now the multiplicative group 
of nonzero elements of the finite field Z, is cyclic, and has order p — 1. Since 4 is a 
divisor of p — 1, we see that Z, contains an element n of multiplicative order 4. It follows 
that n? has multiplicative order 2, son? = —1 in Z,. Thus in Z, we have n” = —1 (mod 
p), 80 p divides n? + 1 in Z. 

Viewing p and n> +1 in Z[i], we see that p divides n? +1 =(n+i(n —i). Sup- 
pose that p is irreducible in Z[i]; then p would have to divide n + i orn — i. If p divides 
n+i, thenn +i = p(a + bi) for some a, b € Z. Equating coefficients of i, we obtain 
1 = pb, which is impossible. Similarly, p divides n — i would lead to an impossible 
equation —1 = pb. Thus our assumption that p is irreducible in Z[i] must be false. 

Since p is not irreducible in Z[i], we have p = (a + bi)(c + di) where neither a + 
bi nor c+ di is a unit. Taking norms, we have p? = (a? + b?\(c? + d?) where neither 
a? +b? = 1 nor c? +d? = 1. Consequently, we have p = a* +b’, which completes 
our proof. [Since a’ + b* = (a+ bi\(a — bi), we see that this is the factorization of p, 
that is, c+ di =a -— bi.] e 


Exercise 10 asks you to determine which primes p in Z remain irreducible in Z[i]. 


@ EXERCISES 47 


Computations 


In Exercises | through 4, factor the Gaussian integer into a product of irreducibles in Z[i]. [Hint: Since an irreducible 
factor of a € Z[i] must have norm > 1 and dividing N(q), there are only a finite number of Gaussian integers a + bi 
to consider as possible irreducible factors of a given a. Divide w by each of them in C, and see for which ones the 
quotient is again in Z[i].] 


1. 5 2.7 3. 443i 4. 6-—7i 
5. Show that 6 does not factor uniquely (up to associates) into irreducibles in Z[./—5]. Exhibit two different 
factorizations. 


6. Consider a = 7+ 2i and 68 = 3 — 4i in Zi]. Find o and p in Z{i] such that 
a=fo+p with N(p) < N(f). 
[Hint: Use the construction in the proof of Theorem 47.4. 


7. Use a Euclidean algorithm in Z[/] to find a gcd of 8 + 67 and 5 — 15i in Z[Z]. [Hint: Use the construction in 
the proof of Theorem 47.4.] 


Concepts 
8. Mark each of the following true or false. 
a. Z[t] is a PID. 
b. Z[i] is a Euclidean domain. 
c. Every integer in Z is a Gaussian integer. 
d. Every complex number is a Gaussian integer. 
e. A Euclidean algorithm holds in Z[i]. 


f. A multiplicative norm on an integral domain is sometimes an aid in finding irreducibles of the 
domain. 


16. 


17, 


18. 
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g. If N is a multiplicative norm on an integral domain D, then |N(u)| = 1 for every unit u of D. 

h. If F is a field, then the function N defined by N(f(x)) = (degree of f(x)) is a multiplicative norm 
on F[x]. 

i. If F is a field, then the function defined by N(f(x)) = 2@28* ff) for f(x) 4 0 and N(O) = 0 
is a multiplicative norm on F[x] according to our definition. 

j. Z[/—5] is an integral domain but not a UFD. 


. Let D be an integral domain with a multiplicative norm N such that |V(q@)| = 1 fora ¢ Difand only ifa isa 


unit of D. Let x be such that |N(r)| is minimal among all |N(8)| > 1 for 6 € D. Show that z is an irreducible 
of D. 


. a. Show that 2 is equal to the product of a unit and the square of an irreducible in Z[7]. 


b. Show that an odd prime p in Z is irreducible in Z[i] if and only if p = 3 (mod 4). (Use Theorem 47.10.) 


. Prove Lemma 47.2. 
. Prove that N of Example 47.9 is multiplicative, that is, that N(aB) = N(a)N() for a, B € Z[/—S]. 
. Let D be an integral domain with a multiplicative norm N such that |N(q@)| = 1 fora ¢ Difand only ifa@ isa 


unit of D. Show that every nonzero nonunit of D has a factorization into irreducibles in D. 


. Use a Euclidean algorithm in Z[i] to find a gcd of 16+ 7i and 10 — 5i in Z[i]. [Hint: Use the construction in 


the proof of Theorem 47.4.] 


. Let (a) be a nonzero principal ideal in Z[i]. 


a. Show that Z[?]/(q) is a finite ring. [Hint: Use the division algorithm.] 
b. Show that if w is an irreducible of Z[i], then Z[i]/(z:) is a field. 
c. Referring to part (b), find the order and characteristic of each of the following fields. 
i. Z[i]/ (3) ; ii. Z[i]/(1 +7) tit, Z[Z]/(1 + 27} 
Let n € Z* be square free, that is, not divisible by the square of any prime integer. Let Z[./—n] = {a + 
ib/n|a,b € Z}. 
a. Show that the norm N, defined by N(a@) = a? + nb? fora = a + ib./n, is a multiplicative norm on ZL./—n]. 
b. Show that N(@) = 1 fora € ZL./—n] if and only if a is a unit of ZL/—x]. 
c. Show that every nonzero a € Z[./—n] that is not a unit has a factorization into irreducibles in Z[./—n]. 
[Hint: Use part (b).] 
Repeat Exercise 16 for Z[,/n] = {a + b./n ja, b € Z}, with N defined by N(a) = a? — nb? fora =a+bJn 
in ZL/7]. 
Show by a construction analogous to that given in the proof of Theorem 47.4 that the division algorithm holds 
in the integral domain Z[./—2] for v(~) = N(q@) for nonzero @ in this domain (see Exercise 16). (Thus this 
domain is Euclidean. See Hardy and Wright [29] for a discussion of which domains Z[,/n] and Z[,/—n] are 
Euclidean.) 
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AUTOMORPHISMS OF FIELDS 


The Conjugation Isomorphisms of Algebraic Field Theory 


Let F be a field, and let F be an algebraic closure of F, that is, an algebraic extension 
of F that is algebraically closed. Such a field F exists, by Theorem 31.17. Our selection 
of a particular F is not critical, since, as we shall show in Section 49, any two algebraic 
closures of F are isomorphic under a map leaving F fixed. From now on in our work, 
we shall assume that all algebraic extensions and all elements algebraic over a field F 
under consideration are contained in one fixed algebraic closure F of F. 

Remember that we are engaged in the study of zeros of polynomials. In the ter- 
minology of Section 31, studying zeros of polynomials in F'[x] amounts to studying 
the structure of algebraic extensions of F and of elements algebraic over F. We shall 
show that if EF is an algebraic extension of F with a, B € E, then w and £ have the 
same algebraic properties if and only if irr(a, F) = irr(f, F’). We shall phrase this fact 
in terms of mappings, as we have been doing all along in field theory. We achieve 
this by showing that if irr(a, F) = irr(8, F), then there exists an isomorphism Wy,g 
of F(a) onto F(f) that maps each element of F onto itself and maps @ onto 6. The 
next theorem exhibits this isomorphism wo,g. These isomorphisms will become our 
fundamental tools for the study of algebraic extensions; they supplant the evaluation ho- 
momorphisms ¢, of Theorem 22.4, which make their last contribution in defining these 
isomorphisms. Before stating and proving this theorem, let us introduce some more 
terminology. 


¥ Section 52 is not required for the remainder of the text. 
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Part X 


48.1 Definition 


48.2 Example 


48.3 Theorem 


Proof 


Automorphisms and Galois Theory 


Let EF be an algebraic extension of a field F. Two elements a, 6 € E are conjugate over 
F ifirr(a, F) =irr(B, F), that is, if @ and 6 are zeros of the same irreducible polynomial 
over F’. = 


The concept of conjugate elements just defined conforms with the classic idea of con- 
jugate complex numbers if we understand that by conjugate complex numbers we mean 
numbers that are conjugate over R. If a, b € Rand b ¥$ 0, the conjugate complex num- 
bers a + bi and a — bi are both zeros of x7 — 2ax + a? + b*, which is irreducible in 
R[x]. A 


(The Conjugation Isomorphisms) Let F be a field, and let a and £ be algebraic over 
F with deg(a, F) =n. The map Wy,g : F(@) > F() defined by 


Wa,p(Co + Ci +++ + Cpa” ') = co + C1B +++ + Oni B" 


for c; € F is an isomorphism of F(a) onto F() if and only if a and £ are conjugate 
over F. 


Suppose that Wy : F(a@) —> F(@) as defined in the statement of the theorem is an iso- 
morphism. Letirr(a, F) = ap + ayx +--+ + a,x". Thenag + aja +---+a,0” = 0,so 


Wa,p(do + aa +--+ 4,0") = ay +B +---+a,f" =0. 


By the last assertion in the statement of Theorem 29.13 this implies that ir(8, F) di- 
vides irr(a, F). A similar argument using the isomorphism (%o,3)~' = Wg,4 Shows that 
irr(a, F) divides irr(B, F). Therefore, since both polynomials are monic, irr(@, F) = 
im(6, F), so a and f are conjugate over F. 

Conversely, suppose irr(@, F) = irr(8, F) = p(x). Then the evaluation homomor- 
phisms ¢, : F[x] > F(a) and ¢g : F[x] — F(f) both have the same kernel (p(x)). 
By Theorem 26.17, corresponding to @, : F[x] > F(q), there is a natural isomorphism 
Wa mapping F'[x]/(p(x)) onto ¢,[F[x]] = F(a). Similarly, @ gives rise to an isomor- 
phism wg mapping F'[x]/(p(x)) onto F(B). Let vo,g = ve(Wo)?. These mappings are 
diagrammed in Fig. 48.4 where the dashed lines indicate corresponding elements under 
the mappings. As the composition of two isomorphisms, Wy,g is again an isomorphism 
and maps F(e) onto F(B). For (ey) + cya +--+ +c,-;a"7!) € F(a), we have 


Vo,p(Co + C18 +++ + Cn—-ya” 4) 
= (Wea ')(co tera tees t nya!) 


Fix] 
! 
/ 
x 
be 
¥ y = canonical 
residue class map 
Fi va x F 
. “* Te eee Vs 7 ) 
a ++ (p@)) B 


48.4 Figure 


48.5 Corollary 


Proof 


48.6 Corollary 


Proof 
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= Wpl(co t crx +++ + ex" ') + (p@))) 
Scope Pt ++ enuf". 
Thus wo, is the map defined in the statement of the theorem. ¢ 


The following corollary of Theorem 48.3 is the cornerstone of our proof of the 
important Isomorphism Extension Theorem of Section 49 and of most of the rest of our 
work. 


Let a be algebraic over a field F. Every isomorphism yw mapping F(a) onto a subfield 
of F such that (a) = a for a € F maps a onto a conjugate £ of a over F. Conversely, 
for each conjugate 6 of a over F, there exists exactly one isomorphism wWy,g of F(a) 
onto a subfield of F mapping « onto 8 and mapping each a € F onto itself. 


Let y be an isomorphism of F(@) onto a subfield of F such that y(a) = a fora € F. 
Let inr(a, F) = ap t+ ayx +---+ a,x". Then 
dog taat--.-+a,a” =0, 
Se) 
0 = v(ag + aya +--+ + ance") = ag tala) +++ +anv(a)", 


and 6 = w(a) is a conjugate ofa. 

Conversely, for each conjugate 6 of w over F,, the conjugation isomorphism o,g 
of Theorem 48.3 is an isomorphism with the desired properties. That W,,, is the only 
such isomorphism follows from the fact that an isomorphism of F(a) is completely 
determined by its values on elements of F and its value on &. 5 


As a second corollary of Theorem 48.3, we can prove a familiar result. 
Let f(x) € R[x]. If f(a + bi) = 0 for (a + bi) € C, wherea, b € R, then f(a — bi) = 
0 also. Loosely, complex zeros of polynomials with real coefficients occur in conjugate 
pairs. 
We have seen that C = R(i). Now 

ini, R) = in(-i, R) = x* +1, 


soi and —i are conjugate over R. By Theorem 48.3, the conjugation map ¥;,_; : C > C 
where w;,_;(a + bi) = a — bi is an isomorphism. Thus, if for a; € R, 


flat bi) =ap+as(atbi)+---+a,(a4+ di” =0, 
then 
0= WiC f(a t bi)) = ay tay(a — bi) +--+ + a,(a — bi" 
= f(a—bi), 
that is, f(a — bi) = Oalso. . 
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Part X 


48.7 Example 


48.8 Definition 
48.9 Definition 


48.10 Example 


Automorphisms and Galois Theory 


Consider Q(/2) over ©. The zeros of irr(./2, Q) = x? —2are /2, and —/2, so /2 and 
—./2 are conjugate over Q. According to Theorem 48.3 the map Ji V2* Q/2) > 


Q(V2) defined by 
vy pla + bV2) =a — bV2 


is an isomorphism of Q(./2) onto itself. A 


Automorphisms and Fixed Fields 


As illustrated in the preceding corollary and example, a field may have a nontrivial 
isomorphism onto itself. Such maps will be of utmost importance in the work that follows. 


An isomorphism of a field onto itself is an automorphism of the field. a 


If o is an isomorphism of a field E onto some field, then an element a of E is left 
fixed by o if o(a) =a. A collection S of isomorphisms of FE leaves a subfield F of E 
fixed if each a € F is left fixed by every o € S. If {o} leaves F fixed, then o leaves 
F fixed. a 


Let E = Q(V2, V3). The map o : E > E defined by 
o(a+bV2+ev34+dV6) =a + bV2—cV3 —dV6 


for a, b, c,d € Q is an automorphism of £; it is the conjugation isomorphism WN) 
of E onto itself if we view E as (QU/2))\V/3). We see that o leaves Q(/2) fixed. A 


It is our purpose to study the structure of an algebraic extension FE of a field F by 
studying the automorphisms of £ that leave fixed each element of F’. We shall presently 
show that these automorphisms form a group in a natural way. We can then apply the 
results concerning group structure to get information about the structure of our field 
extension. Thus much of our preceding work is now being brought together. The next 
three theorems are readily proved, but the ideas contained in them form the foundation 
for everything that follows. These theorems are therefore of great importance to us. They 
really amount to observations, rather than theorems; it is the ideas contained in them 
that are important. A big step in mathematics does not always consist of proving a hard 
theorem, but may consist of noticing how certain known mathematics may relate to new 
situations. Here we are bringing group theory into our study of zeros of polynomials. Be 
sure to understand the concepts involved. Unlikely as it may seem, they are the key to 
the solution of our final goal in this text. 


Final Goal (to be more precisely stated later): To show that not all zeros of 
every quintic (degree 5) polynomial f(x) can be expressed in terms of radicals 
starting with elements in the field containing the coefficients of f(x). 
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@ HISTORICAL NOTE 


I was Richard Dedekind who first developed the 
idea of an automorphism of a field, what he called 
a “permutation of the field,” in 1894. The earlier ap- 
plication of group theory to the theory of equations 
had been through groups of permutations of the 
roots of certain polynomials. Dedekind extended 
this idea to mappings of the entire field and proved 
several of the theorems of this section. 

Though Heinrich Weber continued Dedekind’s 
approach to groups acting on fields in his algebra 
text of 1895, this method was not pursued in other 
texts near the turn of the century. It was not until the 
1920s, after Emmy Noether’s abstract approach to 


algebra became influential at Gottingen, that Emil 
Artin (1898-1962) developed this relationship of 
groups and fields in great detail. Artin emphasized 
that the goal of what is now called Galois theory 
should not be to determine solvability conditions 
for algebraic equations, but to explore the relation- 
ship between field extensions and groups of auto- 
morphisms. Artin detailed his approach in a lecture 
given in 1926; his method was first published in 
B. L. Van der Waerden’s Modern Algebra text of 
1930 and later by Artin himself in lecture notes in 
1938 and 1942. In fact, the remainder of this text is 
based on Artin’s development of Galois theory. 
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If (0; |i € 1} is a collection of automorphisms of a field E, the elements of E about 
which {o; |i € 7} gives the least information are those a € E left fixed by every o; for 
i ¢ I, This first of our three theorems contains almost all that can be said about these 
fixed elements of £. 


48.11 Theorem Let {o; |i € J} be a collection of automorphisms of a field E. Then the set E(,,) of all 


a € E left fixed by every 0; fori € I forms a subfield of E. 


Proof If o;(a) = a and o;(b) = b for alli ¢€ [, then 
a(akb)=o(a)to(b)=atb 
and 
o;(ab) = o;(a)o;(b) = ab 
for alli € J. Also, tf b # 0, then 
o;(a/b) = 9;(a)/o;(b) = a/b 
for alli € J. Since the o; are automorphisms, we have 
o;(0) = 0 oj(1I) =1 
for alli € J. Hence 0, 1 € Ej,,; Thus Ejo,; is a subfield of E. + 


and 


48.12 Definition The field Ey,,, of Theorem 48.11 is the fixed field of {o; |i ¢ /}. For a single automor- 


phism o, we shall refer to E,,, as the fixed field of o. i 


48.13 Example Consider the automorphism 5 _ 5 of Q(/2) given in Example 48.7. For a, b € Q, 


we have 
wy yqla + bV2) =a — bv2, 
anda — bV2 =a + by? ifand only if b = 0. Thus the fixed fieldofwyz_jzisQ. 
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Part X 


48.14 Theorem 
Proof 


48.15 Theorem 


Proof 


48.16 Definition 


48.17 Example 


Automorphisms and Galois Theory 


Note that an automorphism of a field E is in particular a one-to-one mapping of 
E onto E, that is, a permutation of E. lf o and t are automorphisms of E, then the 
permutation oT is again an automorphism of E, since, in general, composition of homo- 
morphisms again yields a homomorphism. This is how group theory makes its entrance. 


The set of all automorphisms of a field F is a group under function composition. 


Multiplication of automorphisms of EF is defined by function composition, and is thus 
associative (it is permutation multiplication). The identity permutation: : E — E given 
by c(@) = @ for a € E is an automorphism of E. If o is an automorphism, then the 
permutation o~! is also an automorphism. Thus all automorphisms of E forma subgroup 
of Sz, the group of all permutations of E given by Theorem 8.5. ¢ 


Let E bea field, and let F be asubfield of E. Then the set G(E/ F) of all automorphisms of 
E leaving F fixed forms a subgroup of the group of all automorphisms of FE. Furthermore, 
Fes Eger). 
For o,t € G(E/F) anda ¢€ F, we have 
(ot )a) = o(t(a)) = o(a) =a, 

so ot € G(E/F). Of course, the identity automorphism : is in G(E/F). Also, if 
o(a) =a for a € F, then a=o~(a) so o € G(E/F) implies that 07! € G(E/F). 
Thus G(E'/F) is a subgroup of the group of all automorphisms of F. 

Since every element of F is left fixed by every element of G(E/F), it follows 


immediately that the field Egz/) of all elements of FE left fixed by G(E/F) con- 
tains F. . 4 


The group G(E/F) of the preceding theorem is the group of automorphisms of E 
leaving F fixed, or, more briefly, the group of E over F. 


Do not think of E/F in the notation G(E/F) as denoting a quotient space of some 
sort, but rather as meaning that E is an extension field of the field F. 

The ideas contained in the preceding three theorems are illustrated in the following 
example. We urge you to study this example carefully. a 
Consider the field Q(/2, /3). Example 31.9 shows that [Q(/2, V3) : Q] = 4. If we 
view O(/2, V3) as (Q(/3))(V2), the conjugation isomorphism yz _ yg of Theo- 
rem 48.3 defined by 

Wyss + bV2) = a — bV2 
for a,b € QUJ3) is an automorphism of Q./2, V3) having Q(V3) as fixed field. 
Similarly, we have the automorphism yf 3,3 OF OVv2, 73) having Q(./2) as fixed 
field. Since the product of two automorphisms is an automorphism, we can consider 
W3,_y2¥ 3, v3 Which moves both /2 and V3, that is, leaves neither number fixed. Let 


i = the identity automorphism, 
1= Vy ys 
027 = Wy3,_y3, and 
B= Ve pai A 


48.18 Table 


48.19 Theorem 


Proof 
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The group of all automorphisms of Q(./2, ./3) has a fixed field, by Theorem 48.11. 
This fixed field must contain Q, since every automorphism of a field leaves 1 and hence 
the prime subfield fixed. A basis for Q(./2, /3) over Q is {1, V2, V3, 6}. Since 
o(/2) = —V2, 01 (/6) = —V6 and on(s/3) = —/3, we see that Q is exactly the fixed 
field of {1, 01, 02, 03}. It is readily checked that G = {1, 01, 02, 03} is a group under au- 
tomorphism multiplication (function composition). The group table for G is given in 
Table 48.18. For example, 


P63 Vp cpl Vas) — Vays — OF 


The group G is isomorphic to the Klein 4-group. We can show that G is the full 
group G(Q(V2, V3)/Q), because every automorphism 7 of Q/2, /3) maps V2 onto 
either +./2, by Corollary 48.5. Similarly, t maps /3 onto either +./3. But since 
(1, /2, /3, /2/3} is a basis for Q(/2, V3) over Q, an automorphism of OV/2, V3) 
leaving Q fixed is determined by its values on /2 and ./3. Now, t, 01, 62, and a3 give all 
possible combinations of values on /2 and 4/3, and hence are all possible automorphisms 
of Q(V2, V3). 

Note that G(Q(/2, V3)/Q) has order 4, and [Q(/2, /3) : Q] = 4. This is no ac- 


cident, but rather an instance of a general situation, as we shall see later. A 


The Frobenius Automorphism 


Let F be a finite field. We shall show later that the group of all automorphisms of F 
is cyclic. Now a cyclic group has by definition a generating element, and it may have 
several generating elements. For an abstract cyclic group there is no way of distinguishing 
any one generator as being more important than any other. However, for the cyclic 
group of all automorphisms of a finite field there is a canonical (natural) generator, 
the Frobenius automorphism (classically, the Frobenius substitution). This fact is of 
considerable importance in some advanced work in algebra. The next theorem exhibits 
this Frobenius automorphism. 


Let F be a finite field of characteristic p. Then the map o, : F — F defined by o,(a) = 
a? fora € F isan automorphism, the Frobenius automorphism, of F. Also, Fig,) ~ Zy. 


Let a,b € F. Taking n = 1 in Lemma 33.9, we see that (a + b)? = a? + b?. Thus we 
have 


op(a+b)=(a+ by =a’? +b? =0,(a) +0,(0). 


Of course, 


op(ab) = (ab)? = a?b? = o,(a)op(b), 


SO op is at least ahomomorphism. If c,(a) = 0, then a? = 0, anda = 0, so the kernel of 
op is {0}, and o, is a one-to-one map. Finally, since F is finite, o, is onto, by counting. 
Thus o, is an automorphism of F’. 
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The prime field Z, must be contained (up to isomorphism) in F, since F is of 
characteristic p. For c € Zp, we have o,(c) = c? = c, by Fermat’s theorem (see Corol- 
lary 20.2). Thus the polynomial x? — x has p zeros in F,, namely the elements of Z,. By 
Corollary 23.5, a polynomial of degree n over a field can have at most n zeros in the field. 
Since the elements fixed under o, are precisely the zeros in F of x? — x, we see that 


Lp = Vee ° 


Freshmen in college still sometimes make the error of saying that (a +b)” = 
a" + b", Here we see that this freshman exponentiation, (a + b)? = a? + b? with ex- 
ponent p, is actually valid in a field F of characteristic p. 


@ EXERCISES 48 


Computations 


In Exercises 1 through 8, find all conjugates in C of the given number over the given field. 


1. /2 over Q 2. /2 overR 

3. 3+ 2 over Q 4. /2 — V3 over Q 

5. /2 +i overQ 6. /2 +i overR 

7. V1 ead over Q 8. V1 + V2 over QV/2) 


In Exercises 9 through 14, we consider the field E = Q(./2, V3, V5). It can be shown that [E : Q] = 8. In the 
notation of Theorem 48.3, we have the following conjugation isomorphisms (which are here automorphisms of F): 


v3 QV3, V5))\(V2) > (QV, V5)\-V2), 
vy _ ya (QU, V5)\(V3) > (Q(V2, ¥5))(-V3), 
Wyss: (QU/2, V3)V5) > (QV2, V3)(—V5). 


For shorter notation, let 2 = Wy 4. = Wy and § = Wy _ yf. Compute the indicated element of E. 


9. t(/3) 10. (v2 + V5) 
V2 — 3/5 
LL. (t302)(/2 + 3/5) 12. («573) ( ——~ 
372 (TsTs) 2/3 _J3 
13. (ts?t32)(V2 + V45) 14. [t5(/2 — V3 + (1215)(V/30))] 
15. Referring to Example 48.17, find the following fixed fields in E = Q(./2, V3). 
a. Eto,,03) b. Efex c. Eve,.03} 


In Exercises 16 through 21, refer to the directions for Exercises 9 through 14 and find the fixed field of the 
automorphism or set of automorphisms of E. 


16. 73 Nea 18. {t2, t3} 
19, T5T2 20. T5 7302 21. {T2, 73, Ts} 
22. Refer to the directions for Exercises 9 through 14 for this exercise. 


a. Show that each of the automorphisms 1, t; and ts is of order 2 in G(E/Q). (Remember what is meant by 
the order of an element of a group.) 
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b. Find the subgroup H of G(E/Q) generated by the elements 1, 73, and ts, and give the group table. [Hint: 
There are eight elements. ] 
c. Just as was done in Example 48.17, argue that the group H of part (b) is the full group G(E/Q). 


Concepts 


In Exercises 23 and 24, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


Two elements, w and , of an algebraic extension E of a field F are conjugate over F if and only if they are 
both zeros of the same polynomial f(x) in F[x]. 


Two elements, a and 8, of an algebraic extension E of a field F are conjugate over F if and only if the 
evaluation homomorphisms ¢, : F[x] > E and dg : F[x] > E have the same kernel. 
The fields Q(./2) and Q(3 + V2) are the same, of course. Let a = 3 + G2, 


a. Find a conjugate 6 4 a of a over Q. 

b. Referring to part (a), compare the conjugation automorphism y5_ 5 of Q(/2) with the conjugation 
automorphism yq_4. 

Describe the value of the Frobenius automorphism a» on each element of the finite field of four elements given 

in Example 29.19. Find the fixed field of o>. 

Describe the value of the Frobenius automorphism o3 on each element of the finite field of nine elements given 

in Exercise 18 of Section 29. Find the fixed field of o3. 

Let F bea field of characteristic p 4 0. Give an example to show that the map o, : F > F givenbyo,(a) = a? 

for a € F need not be an automorphism in the case that F is infinite. What may go wrong? 


Mark each of the following true or false. 


a. For alla, 8 € E, there is always an automorphism of E mapping @ onto f. 

b. For a, 6 algebraic over a field F, there is always an isomorphism of F(a) onto F(B). 

c. Fora, 8 algebraic and conjugate over a field F, there is always an isomorphism of F(a) onto F(B). 
d. Every automorphism of every field E leaves fixed every element of the prime subfield of E. 

e. Every automorphism of every field E leaves fixed an infinite number of elements of E. 

f. Every automorphism of every field E leaves fixed at least two elements of EZ. 


g. Every automorphism of every field E of characteristic 0 leaves fixed an infinite number of elements 
of E. 


h. All automorphisms of a field £ form a group under function composition. 
i. The set of all elements of a field E left fixed by a single automorphism of E forms a subfield of E. 
j- For fields F < E < K, G(K/E) < G(K/F). 


Proof Synopsis 


30. 
31. 


Give a one-sentence synopsis of the “if” part of Theorem 48.3. 


Give a one-sentence synopsis of the “only if” part of Theorem 48.3. 


Theory 


32. 


Let & be algebraic of degree n over F. Show from Corollary 48.5 that there are at most n different isomorphisms 
of F(a) onto a subfield of F and leaving F fixed. 
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33. 


34. 


35. 


36. 


37. 


38. 


39, 
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Let F(a), ---, @,) be an extension field of F. Show that any automorphism o of F(a, +--+, @,) leaving F fixed 
is completely determined by the n values o(c;). 
Let E be an algebraic extension of a field F, and let o be an automorphism of E leaving F fixed. Leta € E. 
Show that o induces a permutation of the set of all zeros of irr(a, F) that are in E. 
Let E be an algebraic extension of a field F. Let S = {o; |i € I} be a collection of automorphisms of F such 
that every o; leaves each element of F fixed. Show that if S generates the subgroup AH of G(E/F), then 
Es =Exu. 
We saw in Corollary 23.17 that the cyclotomic polynomial 

xP—] 


is irreducible over Q for every prime p. Let ¢ be a zero of ©,(x), and consider the field Q(¢). 


a. Show that ¢,¢7,---, ¢?7! are distinct zeros of ® p(x), and conclude that they are all the zeros of ® ,(x). 
b. Deduce from Corollary 48.5 and part (a) of this exercise that G(Q(¢)/Q) is abelian of order p — 1. 
c. Show that the fixed field of G(Q(¢)/Q) is Q. [Hint: Show that 


{f, ae oa 


is a basis for Q(¢) over Q, and consider which linear combinations of ¢, ¢7, ---, ¢?~! are left fixed by all 
elements of G(Q(t)/Q). 


Theorem 48.3 described conjugation isomorphisms for the case where a and 6 were conjugate algebraic ele- 
ments over F.. Is there a similar isomorphism of F(a) with F(8) in the case that w and £ are both transcendental 
over F? 

Let F be a field, and let x be an indeterminate over F. Determine all automorphisms of F(x) leaving F fixed, 
by describing their values on x. 


Prove the following sequence of theorems. 


a. An automorphism of a field E carries elements that are squares of elements in E onto elements that are 
squares of elements of EF. 

b. An automorphism of the field R of real numbers carries positive numbers onto positive numbers. 

c. If o is an automorphism of R anda < b, where a, b € R, then o(a) < o(b). 

d. The only automorphism of R is the identity automorphism. 


Tue ISOMORPHISM EXTENSION THEOREM 


The Extension Theorem 


Let us continue studying automorphisms of fields. In this section and the next, we shall 
be concerned with both the existence and the number of automorphisms of a field E. 
Suppose that F is an algebraic extension of F and that we want to find some au- 
tomorphisms of E. We know from Theorem 48.3 that if a, B € E are conjugate over 
F, then there is an isomorphism wy,g of F(a) onto F(8). Of course, a, 6 ¢ E implies 
both F(a) < E and F(f) < E. It is natural to wonder whether the domain of definition 
of Yq,g can be enlarged from F(a) to a larger field, perhaps all of E, and whether this 
might perhaps lead to an automorphism of E. A mapping diagram of this situation is 
shown in Fig. 49.1. Rather than speak of “enlarging the domain of definition of Wu, g,” it 


49.3 Theorem 
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F F’ 
? =? 
E > E E r= > TE] 
Va 
F(a) ———*° _»> Fg) & > F 
49,1 Figure 49.2 Figure 


is customary to speak of “extending the map ¥,,, to a map 1,” which is a mapping of 
all of E. 

Remember that we are always assuming that all algebraic extension of F under con- 
sideration are contained in a fixed algebraic closure F of F. The Isomorphism Extension 
Theorem shows that the mapping #o,g can indeed always be extended to an isomor- 
phism of E onto a subfield of F. Whether this extension gives an automorphism of E, 
that is, maps E into itself, is a question we shall study in Section 50, Thus this extension 
theorem, used in conjunction with our conjugation isomorphisms yf, g will guarantee the 
existence of lots of isomorphism mappings, at least, for many fields. Extension theorems 
are very important in mathematics, particularly in algebraic and topological situations. 

Let us take a more general look at this situation. Suppose that F is an algebraic 
extension of a field F and that we have an isomorphism o of F onto a field F’. Let F’ 
be an algebraic closure of F’. We would like to extend o to an isomorphism t of E onto 
a subfield of F’. This situation is shown in Fig. 49.2. Naively, we pick a € E but not in 
F and try to extend o to F(q). If 


p(x) = irra, F) = dp + ayx e+ +Gyx", 


let B be a zero in F’ of 


q(x) = o(ap) + o(a)x +--+ + o(G,)x". 


Here g(x) € F’{x]. Since o is an isomorphism, we know that g(x) is irreducible in 
F’{x]. It seems reasonable that F(a@) can be mapped isomorphically onto F’(B) by a 
map extending o and mapping & onto £. (This is not quite Theorem 48.3, but it is close 
to it; a few elements have been renamed by the isomorphism co.) If F(aw) = E, we are 
done. If F(a) # E, we have to find another element in EF not in F(a) and continue the 
process. It is a situation very much like that in the construction of an algebraic closure 
F of a field F. Again the trouble is that, in general, where FE is not a finite extension, 
the process may have to be repeated a (possibly large) infinite number of times, so 
we need Zorn’s lemma to handle it. For this reason, we postpone the general proof of 
Theorem 49.3 to the end of this section. 


(isomorphism Extension Theorem) Let £ be an algebraic extension of a field F. Let 
o be an isomorphism of F onto a field F’. Let F’ be an algebraic closure of F’. Then o 
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49.4 Corollary 


Proof 


49.5 Corollary 


Proof 
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can be extended to an isomorphism t of E onto a subfield of F’ such that t(a) = o(a) 
foralla ¢ F. 


We give as a corollary the existence of an extension of one of our conjugation 
isomorphisms W.,g, as discussed at the start of this section. 


If E < F is an algebraic extension of F and a, 6 € E are conjugate over F, then the 
conjugation isomorphism Wo,g : F(a) > F(B), given by Theorem 48.3, can be extended 
to an isomorphism of £ onto a subfield of F. 


Proof of this corollary is immediate from Theorem 49.3 if in the statement of the theorem 
we replace F by F(a), F’ by F(f), and F’ by F. 5 


As another corollary, we can show, as we promised earlier, that an algebraic closure 
of F is unique, up to an isomorphism leaving F fixed. 


Let F and F’ be two algebraic closures of F. Then F is isomorphic to F’ under an 
isomorphism leaving each element of F fixed. 


By Theorem 49.3, the identity isomorphism of F onto F can be extended to an isomor- 
phism t mapping F onto a subfield of F’ that leaves F fixed (see Fig. 49.6). We need 
only show that t is onto F’. But by Theorem 49.3, the map r~! : r[F] > F can be 
extended to an isomorphism of F’ onto a subfield of F. Since t7} is already onto F, we 


must have t[F] = F’. 5 
F 
F . > LF] 
F : » F 
49.6 Figure 


The Index of a Field Extension 


Having discussed the question of existence, we turn now to the question of how many. For 
a finite extension E of a field F, we would like to count how many isomorphisms there 
are of E onto a subfield of F that leave F fixed. We shall show that there are only a finite 
number of isomorphisms. Since every automorphism in G(E / F)is such an isomorphism. 
a count of these isomorphisms will include all these automorphisms. Example 48.17 
showed that G(QU/2, ¥3)/Q) has four elements, and that 4 = [Q(/2, J3) : Q]. While 
such an equality is not always true, it is true in a very important case. The next theorem 


49,7 Theorem 


Proof 
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takes the first big step in proving this. We state the theorem in more general terms than 
we shall need, but it does not make the proof any harder. 


Let E be a finite extension of a field F. Let o be an isomorphism of F' onto a field 
F’, and let F’ be an algebraic closure of F’. Then the number of extensions of o to an 
isomorphism t of E onto a subfield of F’ is finite, and independent of F’, F’, and o. 
That is, the number of extensions is completely determined by the two fields E and F; 
it is intrinsic to them. 


The diagram in Fig. 49.8 may help us to follow the construction that we are about to 
make. This diagram is constructed in the following way. Consider two isomorphisms 


onto onto 
01: F—> Fi, a2: F —> Fy, 


where F, and Fi are algebraic closures of F] and F,, respectively. Now 020, ‘is an 
isomorphism of Fj onto Fj. Then by Theorem 49.3 and Corollary 49.5 there is an 
isomorphism 


he Fi SS F 
extending this isomorphism 020, ew i = F;. Referring to Fig. 49.8, corresponding to 
each 1; : E — F) that extends o; we obtain an isomorphism 12 : E > F,, by starting 
at E and going first to the left, then up, and then to the right. Written algebraically, 


: To(@) = (AT) )(@) 


fora € E.Clearly t2 extends 0. The fact that we could have started with t2 and recovered 
t, by defining 


T(@) = (A! t2)(@), 


that is, by chasing the other way around the diagram, shows that the correspondence 
between 7, : E > Fi and tT: EF +> Fi is one to one. In view of this one-to-one corre- 
spondence, the number of t extending o is independent of F’, F’ andc. 

‘That the number of mappings extending o is finite follows from the fact that since E is 
a finite extension of F, EF = F(ay,---,@,)forsomea,,---,a, in E, by Theorem 31.11. 


Fi ‘ =I a 
Extends 020; 5 


oy 


x 


+ 7 
alél = Caner oe 


1 2 : 
Fi~< F > Fy 


49.8 Figure 
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49.9 Definition 


49.10 Corollary 


Proof 


49.11 Example 


Proof 


Automorphisms and Galois Theory 


There are only a finite number of possible candiates for the images t(a@;) in F’, for if 
inr(a;, F) = ajo + aX + +++ + dim, x™, 
where a;, < F, then t(a@;) must be one of the zeros in F’ of 


[o (aio) + 0 (@i)x + +++ + OGim, x] € F'Ix]. re 
Let E be a finite extension of a field F. The number of isomorphisms of E onto a subfield 
of F leaving F fixed is the index {E : F} of E over F. | | 


If F < E < K, where K is a finite extension field of the field F, then {K : F} = 
{K : E}{E: F}. 


It follows from Theorem 49.7 that each of the {EZ : F} isomorphisms 1; of E onto a 
subfield of F leaving F fixed has {K : E} extensions to an isomorphism of K onto a 
subfield of F. 


The preceding corollary was really the main thing we were after. Note that it counts 
something. Never underestimate a result that counts something, even if it is only called 
a “corollary.” 

We shall show in Section 51 that unless F is an infinite field of characteristics p 4 0, 
we always have [E : F] = {E : F} for every finite extension field E of F. For the case 
E = F(a), the {F(@) : F} extensions of the identity map: : F > F to maps of F(a) 
onto a subfield of F are given by the conjugation isomorphisms Wu, for each conjugate 
B in F of @ over F. Thus if irr(a, F) has n distinct zeros in F, we have {E : F} =n. 
We shall show later that unless F is infinite and of characteristic p 4 0, the number of 
distinct zeros of irr(a, F’) is deg(a, F) = [F(a): F]. 


Consider EF = v2, 4/3) over Q, as in Example 48.17. Our work in that example shows 

that (E : Q} = [E : Q] =4. Also, {E : Q(V2)} = 2, and {Q(V2) : Q} = 2, so 
4={E: Q={E: QV2)H{QW2): Q = 2). 

This illustrates Corollary 49.10 A 


Proof of the Extension Theorem 


We restate the Isomorphism Extension Theorem 49.3. 


Isomorphism Extension Theorem Let £ be an algebraic extension of a field F. Let 
o be an isomorphism of F onto a field F’. Let F’ be an algebraic closure of F’. Then a 
can be extended to an isomorphism t of E onto a subfield of F’ such that t(@) = a(a) 
fora € F. 


Consider all pairs (Z, A), where L is a field such that F < L < E and A is an isomor- 
phism of L onto a subfield of F’ such that A(a) = o(a) for a € F. The set S of such 
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pairs (L, A) is nonempty, since (F,, 0) is such a pair. Define a partial ordering on S' by 
(Li, a1) S (£2, Ao), if Li < Lo and Aj(a) = Ad(a) for a € Lj. It is readily checked that 
this relation < does give a partial ordering of S. 

Let T = {(H;, 4;)|i € 2} be achain of S. We claim that H = UJ,., Hj is a subfield 
of E. Leta, b € H, where a € Hj and b € A; then either H, < Hy or Hy < Aj, since 
T is achain. If, say, H, < A, then a,b, € Ho,soa +b, ab, and a/b for b £0 are all 
in H and hence in A. Since for eachi ¢ J, F C H; C E, wehave F C A CE. Thus 
H isa subfield of E. 

Define A: H — F’ as follows. Letc € H. Thenc € H; for somei € I, and let 


A(c) = Ai(c). 


The map A is well defined because if c € H, andc € Ab, then either (H,, 41) < (Ab, A) 
or (ff, 42) < (Ai, 44), since T is a chain. In either case, A, (c) = A2(c). We claim that A 
is an isomorphism of H onto a subfield of F’. If a, b € H then there is an H; such that 
a,b € H,, and 


Ma + b) = Aila + b) = Ai@) + Ad) = AG) + A). 
Similarly, 
Mab) = 4;(ab) = Aj(@)A,(b) = A(a)A(D). 
If A(a) = 0, then a € H; for some i implies that A;(a) = 0, so a = 0. Therefore, A is 
an isomorphism. Thus (H, 4) € S, and it is clear from our definitions of H and A that 
(H, i) is an upper bound for T. 

We have shown that every chain of S has an upper bound in S, so the hypotheses 
of Zorn’s lemma are satisfied. Hence there exists a maximal element (K, rt) of S. Let 
t(K) = K’, where K’ < F’. Now if K # E, leta ¢ E buta ¢ K. Now a is algebraic 
over F, so @ is algebraic over K. Also, let p(x) = itr(a, K). Let wy be the canonical 
isomorphism 

Wa : K[x]/{p(x)) > Kt@), 
corresponding to the evaluation homomorphism ¢, : K[x] > K(q). If 
DX) = ag + ax + +--+ yx", 
consider 


q(x) = Tao) + T(ay)x + +++ + Tay) x" 


in K’[x]. Since t is an 1 isomorphism, g(x) is irreducible in K’[x]. Since K’ < F’, there 
is a zero a’ of g(x) in F’. Let 


Va : K'[x]/(q(x)) > Ke’) 
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Kt] ——2—> xt 
Canonical Canonical 
“4 K—_1__> x’ v' 
Wo = ra Var ye 
K() <—— Ka] /(p@)) Foyty) > K'[x] (px) ———> B'@) 
49.12 Figure 


be the isomorphism analogous to wy. Finally, let 
t : K[x]/(p@)) > K'[x]/(q@)) 


be the isomorphism extending t on K and mapping x + (p(x)) onto x + (q(x)). See 
Fig. 49.12.) Then the composition of maps 


vat, ) + Ka) > K'@) 


is an isomorphism of K (a) onto a subfield of F’. Clearly, (K, 7) < (K(@), Wat; 
which contradicts that (K, t) is maximal. Therefore we must have had K = E. Sd 
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Computations 


Let E = Q(/2, V3, 5). It can be shown that [E : Q] = 8. In Exercises 1 through 3, for the given isomorphic 
mapping of a subfield of £, give all extensions of the mapping to an isomorphic mapping of E onto a subfield of 
©. Describe the extensions by giving values on the generating set {/2, /3, /5} for E over Q. 


le: Qv2, 15) > Qv2, »/15), where « is the identity map 
2. 6 : Q/2, V15) > Q(V2, VTS) where o(/2) = V2 and o(/15) = —V15 
3. Wm, ym | QWV30) > QV30) 


It is a fact, which we can verify by cubing, that the zeros of x? — 2 in Q are 
ay - /2, pee and a3 ye 
where «/2, as usual, is the real cube root of 2. Use this information in Exercises 4 through 6. 
4. Describe all extensions of the identity map of Q to an isomorphism mapping Q(./2) onto a subfield of Q. 
5. Describe all extensions of the identity map of Q to an isomorphism mapping Q(/2, V3) onto a subfield of Q. 
6. Describe all extensions of the automorphism yz 4 of Q(/3) to an isomorphism mapping Q(i, V3, ¥/2) 
onto a subfield of Q. 
7. Let o be the automorphism of Q() that maps 2 onto —z. 
a. Describe the fixed field of o. 
b. Describe all extensions of o to an isomorphism mapping the field Q(./7) onto a subfield of Q(z). 
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Concepts 


8. 


Mark each of the following true or false. 


a. Let F(a) be any simple extension of a field F. Then every isomorphism of F onto a subfield of F 
has an extension to an isomorphism of F(q@) onto a subfield of F. 

b. Let F(@) be any simple algebraic extension of a field F. Then every isomorphism of F' onto a 
subfield of F has an extension to an isomorphism of F(a) onto a subfield of F. 

c. An isomorphism of F onto a subfield of F has the same number of extensions to each simple 
algebraic extension of F’. 

__ d. Algebraic closures of isomorphic fields are always isomorphic. 

. Algebraic closures of fields that are not isomorphic are never isomorphic. 

____ f. Any algebraic closure of Q(./2) is isomorphic to any algebraic closure of Q/17). 

. The index of a finite extension E over a field F is finite. 

. The index behaves multiplicatively with respect to finite towers of finite extensions of fields. 


. Our remarks prior to the first statement of Theorem 49.3 essentially constitute a proof of this 
theorem for a finite extension E over F. 


o 


me OG 


j. Corollary 49.5 shows that C is isomorphic to @. 


Theory 


9. 


10. 


11. 


12. 


13. 


Let K be an algebraically closed field. Show that every isomorphism o of K onto a subfield of itself such that 
K is algebraic over o[K] is an automorphism of K, that is, is an onto map. [Hint: Apply Theorem 49.3 to 0—!.] 


Let E be an algebraic extension of a field F. Show that every isomorphism of E onto a subfield of F leaving 
F fixed can be extended to an automorphism of F. 


Prove that if Z is an algebraic extension of a field F, then two algebraic closures F and E of F and E, 
respectively, are isomorphic. 

Prove that the algebraic closure of Q(,/7) in C is isomorphic to any algebraic closure of Q(x), where Q is the 
field of algebraic numbers and x is an indeterminate. 

Prove that if E is a finite extension of a field F, then {F : F} < LE: F]. |Hint: The remarks preceding 
Example 49.11 essentially showed this for a simple algebraic extension F(a) of F. Use the fact that a finite 
extension is a tower of simple extensions, together with the multiplicative properties of the index and degree.] 


SPLITTING FIELDS 


We are going to be interested chiefly in automorphisms of a field E, rather than mere 
isomorphic mappings of E onto a subfield of E. It is the automorphisms ofa field that form 
a group. We wonder whether for some extension field £ of a field F, every isomorphic 
mapping of E onto a subfield of F leaving F fixed is actually an automorphism of E. 

Suppose E is an algebraic extension of a field E. If w € E and B € Fis a conjugate 
of a over F, then there is a conjugation isomorphism 


Va,p : F(a) > F(B). 


By Corollary 49.4, Wa, can be extended to an isomorphic mapping of E onto a subfield 
of F. Now if 6 ¢ E, such an isomorphic mapping of E can’t be an automorphism of E£. 
Thus, if an algebraic extension E of afield F is such that all its isomorphic mappings onto 
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Part X 


50.1 Definition 


50.2 Example 


50.3 Theorem 


Proof 


Automorphisms and Galois Theory 


a subfield of F leaving F fixed are actually automorphisms of E, then for every a € E, 
all conjugates of over F must be in E also. This observation seemed to come very 
easily. We point out that we used a lot of power, namely the existence of the conjugation 
isomorphisms and the Isomorphism Extension Theorem 49.3. 

These ideas suggest the formulation of the following definition. 


Let F bea field with algebraic closure F. Let { f;(x) |i € I} be acollection of polynomials 
in F[x]. A field E < Fis the splitting field of { f(x) |i ¢ 7} over F if E is the smallest 
subfield of F containing F and all the zeros in F of each of the f;(x) fori € I. A field 
K < Fis asplitting field over F if itis the splitting field of some set of polynomials in 
F{x]. a 


We see that Q[/2, V3] isa splitting field of {x? — 2, x? — 3} and also of {x* — 5x? + 6}. 
A 


For one polynomial f(x) € F [x], we shall often refer to the splitting field of { f(x)} 
over F as the splitting field of f(x) over F’. Note that the splitting field of {f;@) |i € I} 
over F in F is the intersection of all subfields of F containing F and all zeros in F of 
each f;(x) fori € J. Thus such a splitting field surely does exist. 

We now show that splitting fields over F are precisely those fields E < F with the 
property that all isomorphic mappings of E onto a subfield of F leaving F fixed are 
automorphisms of EF. This will be a corollary of the next theorem. Once more, we are 
characterizing a concept in terms of mappings. Remember, we are always assuming 
that all algebraic extensions of a field F under consideration are in one fixed algebraic 
élosure F of F. 


Afield Z, where F < E< F, isasplitting field over F if and only if every automorphism 
of F leaving F fixed maps E onto itself and thus induces an automorphism of E leaving 
F fixed. 


Let E be a splitting field over F in F of { f(x) |i € I}, and let o be an automorphism of 
F leaving F fixed. Let {a i 1d € J} be the collection of all zeros in F of all the f;(x) for 
i € I. Now our previous work shows that for a fixed a;, the field F(a@;) has as elements 
all expressions of the form 
g(oj) = a9 tajaj ++ +ay,-107 

where n; is the degree of irr(a,;, F’) and a, € F. Consider the set S of all finite sums of 
finite products of elements of the form g(q,;) for all 7 € /. The set S is a subset of E 
closed under addition and multiplication and containing 0, 1, and the additive inverse 
of each element. Since each element of S is in some F(a;,,---, aj) C S, we see that § 
also contains the multiplicative inverse of each nonzero element. Thus S is a subfield of 
E containing all a; for 7 € J. By definition of the splitting field Z of {f;() |i € I}, we 
see that we must have S = E. All this work was just to show that {a,; | j € J} generates 
E over F, in the sense of taking finite sums and finite products. Knowing this, we 
see immediately that the value of o on any element of E is completely determined by 
the values o(a@;). But by Corollary 48.5, o(@;) must also be a zero of irr(a;, F). By 


50.4 Definition 


50.5 Example 


50.6 Corollary 


Proof 


50.7 Corollary 


Proof 
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Theorem 29.13, irr(@;, F) divides the f;(x) for which f;(@;) = 0, so o(a,;) € E also. 
Thus o maps F onto a subfield of E isomorphically. However, the same is true of the 
automorphism a7! of F. Since for B € E, 


B=o(o '()), 


we see that o maps E onto £, and thus induces an automorphism of E. 

Suppose, conversely, that every automorphism of F leaving F fixed induces an 
automorphism of E. Let g(x) be an irreducible polynomial in F [x] having a zero a in E. 
If 6 is any zero of g(x) in F, then by Theorem 48.3, there is a conjugation isomorphism 
Wa,p of F(x) onto F(B) leaving F fixed. By Theorem 49.3, W.,g can be extended to an 
isomorphism t of F onto a subfield of F. But then 


t+:t[F] > F 


can be extended to an isomorphism mapping F onto a subfield of F. Since the image of 
| is already all of F, we see that r must have been onto F, so t is an automorphism of 
F leaving F fixed. Then by assumption, t induces an automorphism of E, so t(@) = B 
isin E. We have shown that if g(x) is an irreducible polynomial in F [x] having one zero 
in E, then all zeros of g(x) in F are in E. Hence if {g;(x)} is the set of all irreducible 
polynomials in F[x] having a zero in E, then £ is the splitting field of {g,(x)}- 


Let E be an extension field of a field F. A polynomial f(x) € F[x] splits in £ if it 
factors into a product of linear factors in E[x]. a 


The polynomial x*—5x?+6 in Q[x] splits in the field Q[./2, V3] into 
(x —J/ 2) + VS 2)(x — V3)(x + V3). A 


If E < Fisa splitting field over F, then every irreducible polynomial in F[x] having a 
zero in E& splits in £. 


If E is a splitting field over F in F, then every automorphism of F induces an automor- 
phism of E. The second half of the proof of Theorem 50.3 showed precisely that F is 
also the splitting field over F of the set {g;(x)} of all irreducible polynomials in F[x] 
having a zero in £. Thus an irreducible polynomial f(x) of F[x] having a zero in E has 
all its zeros in F in E. Therefore, its factorization into linear factors in F[x]j, given by 
Theorem 31.15, actually takes place in E[x], so f(x) splits in EZ. . 


TE < Fisa splitting field over F, then every isomorphic mapping of E onto a subfield 
of F and leaving F fixed is actually an automorphism of £. In particular, if EF is a splitting 
field of finite degree over F’, then 


{E: F} = |G(E/F)|. 


Every isomorphism o mapping E onto a subfield of F leaving F fixed can be extended 
to an automorphism t of F, by Theorem 49.3, together with the onto argument of the 
second half of the proof of Theorem 50.3. If £ is a splitting field over F’, then by Theo- 
rem 50.3, t restricted to F, that is o, is an automorphism of EF. Thus for a splitting field 
E over F, every isomorphic mapping of E onto a subfield of F leaving F fixed is an 
automorphism of E. 
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The equation {E : F} = |G(E/F)| then follows immediately for a splitting field E 
of finite degree over F, since {E : F} was defined as the number of different isomorphic 
mappings of E onto a subfield of F leaving F fixed. ¢ 


50.8 Example Observe that Q(/2, /3) is the splitting field of 
PS 2 o8? S33} 


over Q. Example 48.17 showed that the mappings ¢, 01, 02, and o3 are all the automor- 
phisms of Q/2, V3) leaving Q fixed. (Actually, since every automorphism of a field 
rust leave the prime subfield fixed, we see that these are the only automorphisms of 


Qv/2, V3).) Then 
(Q(V2, V3) : Q} = |G(Q(V2, ¥3)/Q] = 4. 
illustrating Corollary 50.7. A 


We wish to determine conditions under which 
|G(E/F)|={£: FJ) =[E: F] 
for finite extensions E of F. This is our next topic. We shall show in the following section 
that this equation always holds when £ is a splitting field over a field F of characteristic 


0 or when F is a finite field. This equation need not be true when F is an infinite field 
of characteristic p # 0. 


50.9 Example Let 3/2 be the real cube root of 2, as usual. Now x? — 2 does not split in Q/2), for 
Q(</2) < R and only one zero of x3 — 2 is real. Thus x° — 2 factors in (Q(/2))[x] into 
a linear factor x — </2 and an irreducible quadratic factor. The splitting field E ofx?-2 | 
over Q is therefore of degree 2 over Q(./2). Then 


[E: Q=LE : QW/2)1QW/2) : QI = (2)(3) = 6. 


We have shown that the splitting field over Q of x* — 2 is of degree 6 over Q. 
We can verify by cubing that 


poe sie and foe ae 
2 2 
are the other zeros of x? —2 in C. Thus the splitting field E of x? —2 over Q is 
Q(./2, i/3). (This is not the same field as Q(./2, i, /3), which is of degree 12 over Q.) 
Further study of this interesting example is left to the exercises (see Exercises 7, 8, 9, 
16, 21, and 23). rN 


f@ EXERCISES 50 


Computations 

In Exercises | through 6, find the degree over Q of the splitting field over Q of the given polynomial in Q[x]. 
1. x7 +3 2x71 3. (x? — 2)(x? — 3) 
4. x3 -3 5, x°—1 6. (x? — 2)(x3 — 2) 
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Refer to Example 50.9 for Exercises 7 through 9. 


7. 
8. 
9. 
10. 


What is the order of G(Q(/2)/Q)? 
What is the order of G(QC/2, iV3)/Q)? 
What is the order of G(QC/2, iV¥3)/QW/2))? 


Let w be a zero of x7 + x? + 1 over Zp. Show that x? + x? + 1 splits in Z(a). [Hint: There are eight elements 
in Z2(a). Exhibit two more zeros of x3 + x? + 1, in addition to a, among these eight elements. Alternatively, 
use the results of Section 33.] 


Concepts 


In Exercises 11 and 12, correct the definition of the italicized term without reference to the text, if correction is 
needed, so that it is in a form acceptable for publication. 


11. 


12 


13. 


14. 


15, 
16. 


Let F < E < F where Fis an algebraic closure of a field F. The field E is a splitting field over F if and only 

if E contains all the zeros in F of every polynomial in F[x] that has a zero in E. 

A polynomial f(x) in F[x] splits in an extension field E of F if and only if it factors in E[x] into a product of 

polynomials of lower degree. 

Let f(x) be a polynomial in F[x] of degree n. Let E < F be the splitting field of f(x) over F in F, What 

bounds can be put on [E : F]? 

Mark each of the following true or false. 

a. Leta, B € E, where E < Fisa splitting field over F. Then there exists an automorphism of F 
leaving F fixed and mapping @ onto # if and only if irr(a, F) = irr(f, F). 

—__— _b. Ris a splitting field over Q. 

. Ris a splitting field over R. 

sd. Cisa splitting field over R. 

e. Q(é) is a splitting field over Q. 

____ f. Q(z) is a splitting field over Q(z’). 

_______ g, For every splitting field E over F, where E < F, every isomorphic mapping of E is an automor- 

phism of FE. 


——— h. For every splitting field E over F, where E < F, every isomorphism mapping E onto a subfield 
of Fis an automorphism of E. 


oO 


i. For every splitting field E over F, where E < F, every isomorphism mapping E onto a subfield 
of F and leaving F fixed is an automorphism of EF. 


j. Every algebraic closure F of a field F is a splitting field over F. 


Show by an example that Corollary 50.6 is no longer true if the word irreducible is deleted. 
a. Is |G(E/F)| multiplicative for finite towers of finite extensions, that is, is 
|G(E/F)| = |G(K/E)||G(E/F)| for F<E<K<F) 


Why or why not? (Hint: Use Exercises 7 through 9.] 
b. Is |G(Z/F)| multiplicative for finite towers of finite extensions, each of which is a splitting field over the 
bottom field? Why or why not? 


Theory 


17. 


Show that if a finite extension & of a field F is a splitting field over F, then E is a splitting field of one 
polynomial in F[x]. 
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18. 
19. 


20. 
21. 


22. 


23. 


24. 


25. 


Part X Automorphisms and Galois Theory 


Show that if LE : F] = 2, then E is a splitting field over F. 

Show that for F < E < F, E isa splitting field over F if and only if E contains all conjugates over F in F for 
each of its elements. 

Show that Q(./2) has only the identity automorphism. 

Referring to Example 50.9, show that 


G(QW2, iV3)/Q-V3)) = (Zs, +). 


a. Show that an automorphism of a splitting field E over F of a polynomial f(x) € F[x] permutes the zeros 
of f(x) in E. 

b. Show that an automorphism of a splitting field E over F of a polynomial f(x) € F[x] is completely 
determined by the permutation of the zeros of f(x) in E given in part (a). 

c. Show that if F is a splitting field over F of a polynomial f(x) € FLx], then G(E/F) can be viewed in a 
natural way as a certain group of permutations. 


Let E be the splitting field of x? — 2 over Q, as in Example 50.9. 


a. What is the order of G(E/Q)? [Hint: Use Corollary 50.7 and Corollary 49.4 applied to the tower Q < 
QU v3) < E.] 

b. Show that G(Z/Q) = S3, the symmetric group on three letters. [Hint: Use Exercise 22, together with part 
(a).] 

Show that for a prime p, the splitting field over Q of x? — 1 is of degree p — 1 over Q. [Hint: Refer to 

Corollary 23.17.] 


Let F and F’ be two algebraic closures of a field F, and let f(x) € F[x]. Show that the splitting field E over 
F of f(x) in F is isomorphic to the splitting field E’ over F of f(x) in F’. [Hint: Use Corollary 49.5.] 


SEPARABLE EXTENSIONS 


Multiplicity of Zeros of a Polynomial 


Remember that we are now always assuming that all algebraic extensions of a field F 
under consideration are contained in one fixed algebraic closure F of F. 

Our next aim is to determine, for a finite extension FE of F, under what conditions 
{E: F} =[E: F]. The key to answering this question is to consider the multiplicity of 
zeros of polynomials. 


51.1 Definition Let f(x) € F[x]. Anelement a of F such that f(a) = 0 is azero of f (x) of multiplicity 


v if v is the greatest integer such that (x — a)” is a factor of f(x) in Fx]. a 


The next theorem shows that the multiplicities of the zeros of one given irreducible 
polynomial over a field are all the same. The ease with which we can prove this theorem 
is a further indication of the power of our conjugation isomorphisms and of our whole 
approach to the study of zeros of polynomials by means of mappings. 


51.2 Theorem Let f(x) be irreducible in F[x]. Then all zeros of f(x) in F have the same multiplicity. 
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Proof Leta and f be zeros of f(x) in F. Then by Theorem 48.3, there is a conjugation isomor- 
phism Wo,g : F (aS F(B). By Corollary 49.4, Wa, can be extended to an isomorphism 
t : & — F. Then t induces a natural isomorphism t, : F[x] > F[x], with t,(x) = x. 


Now t, leaves f(x) fixed, since f(x) € F[x] and yo, leaves F fixed. However, 
T(x — @)”) = (x — B)’, 


which shows that the multiplicity of 6 in f(x) is greater than or equal to the multiplicity 
of a. A symmetric argument gives the reverse inequality, so the multiplicity of a equals 
that of 8. ¢ 


51.3 Corollary If f(x) is irreducible in F [x], then f(x) has a factorization in F [x] of the form 
a [o-a”, 
i 


where the a; are the distinct zeros of f(x) in F anda € F. 


Proof The corollary is immediate from Theorem 51.2. ¢ 


At this point, we should probably show by an example that the phenomenon of 
a zero of multiplicity greater than 1 of an irreducible polynomial can occur. We shall 
show later in this section that it can only occur for a polynomial over an infinite field of 
characteristic p 4 0. 


51.4 Example Let E = Z,(y), where y is an indeterminate. Lett = y?, andlet F be the subfield Z,(t) of 
E. (See Fig. 51.5.) Now £ = F(y) is algebraic over F, for y is a zero of (x? — 1) € F[x]. 
By Theorem 29.13, irr(y, F) must divide x? — ¢ in F[x]. [Actually, irr(y, F) = x? — 1. 
We leave a proof of this to the exercises (see Exercise 10).] Since F(y) is not equal to 
F, we must have the degree of irr(y, F’) > 2. But note that 
E=Z,() = Fo) 
AP SES AP ye yr, 


since E has characteristic p (see Theorem 48.19 and the following comment). Thus y is 
F =Z,(t) =Z,(?) a zero of irr(y, F) of multiplicity > 1. Actually, x? — ¢ = irr(y, F), so the multiplicity 
of y is p. A 


From here on we rely heavily on Theorem 49.7 and its corollary. Theorem 48.3 and 
, its corollary show that for a simple algebraic extension F(a) of F there is one extension 
P of the identity isomorphism ¢ mapping F into F for every distinct zero of irr(a, F) and 
51.5 Figure that these are the only extensions of 1. Thus {F (a) : F} is the number of distinct zeros of 
irr(a, F). . 
In view of our work with the theorem of Lagrange and Theorem 31.4, we should 
recognize the potential of a theorem like this next one. 


51.6 Theorem If E isa finite extension of F, then {FE : F} divides [E : F}. 
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Part X 


Proof 


51.7 Definition 


51.8 Example 


51.9 Theorem 


Proof 


Automorphisms and Galois Theory 


By Theorem 31.11, if E is finite over F’, then E = F(q1,---,a@,), where a; € F. Let 
irr(a;, F(a, +++, @;~,)) have a; as one of n; distinct zeros that are all of a common 
multiplicity v;, by Theorem 51.2. Then 


[F(ay, +++, aj): F(oy,---, oj) = ni; = {F(o1, +++, oi): PQ, +++, a}. 
By Theorem 31.4 and Corollary 49.10, 
[E: F] =] ]am, 
and 
{E: F}=| ni. 
Therefore, {E : F} divides [EZ : F]. S 


Separable Extensions 


A finite extension E of F is a separable extension of F if {E: F} =[£: F). An 
element a of F is separable over F if F(a) is a separable extension of F’. An irreducible 
polynomial f(x) € F[x] is separable over F if every zero of f(x) in F is separable 


over F’. | 
The field E = Q[V2, V3] is separable over Q since we saw in Example 50.8 that 
{E:Q)=4=[E: QI. A 


* To make things a little easier, we have restricted our definition of a separable exten- 
sion of a field F to finite extensions E of F. For the corresponding definition for infinite 
extensions, see Exercise 12. 

We know that {F(a): F} is the number of distinct zeros of irr(@, F). Also, the 
multiplicity of w in irr(@, F) is the same as the multiplicity of each conjugate of @ over 
F, by Theorem 51.2. Thus a is separable over F if and only if irr(a, F) has all zeros 
of multiplicity 1. This tells us at once that an irreducible polynomial f(x) € F[x] is 
separable over F if and only if f(x) has all zeros of multiplicity I. 


If K is a finite extension of E and E is a finite extension of F, thatis, F < E < K, then 
K is separable over F if and only if K is separable over E and E is separable over F’. 
Now 

[K: F]=[K: E][E: F], 
and 

{K: FP} ={kK: E}{E: F}. 


Then if K is separable over F, so that [K : F] = {K : F}, we musi have [K : E] = 
{K : E} and [EZ : F] ={E: F}, since in each case the index divides the degree, by 
Theorem 51.6. Thus, if K is separable over F, then K is separable over FE and E is 
separable over F. 


51.10 Corollary 


Proof 


51.11 Lemma 


Proof 
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For the converse, note that[K : E] ={K: E}and[E: F] ={E : F} imply that 
[K: FJ =[K: EE: FJ) ={K: EYE: FP} ={(K: F}. ¢ 


Theorem 51.9 can be extended in the obvious way, by induction, to any finite tower 
of finite extensions. The top field is a separable extension of the bottom one if and only 
if each field is a separable extension of the one immediately under it. 


If E is a finite extension of F, then E is separable over F if and only if each a in E is 
separable over F. 


Suppose that E is separable over F’, and leta ¢ EF. Then 
F<eF@)<&, 


and Theorem 51.9 shows that F(a) is separable over F. 
Suppose, conversely, that every aw € E is separable over F. Since £ is a finite 
extension of F, there exist a1, ---, @, such that 


F< F(a;) < F(a, 02) <-++ < E= F(a), +-+,Qy). 
Now since a; is separable over F, a; is separable over F(a, ---, @j—1), because 


q(x) = ira, F(@1, +++, a1) 


divides irr(a@;, F), so that a; is a zero of g(x) of multiplicity 1. Thus F(a, ---, a) is 
separable over F(a 1,---, aj), so E is separable over F by Theorem 51.9, extended 
by induction. ¢ 


Perfect Fields 


We now turn to the task of proving that a can fail to be separable over F only if F is 
an infinite field of characteristic p # 0. One method is to introduce formal derivatives 
of polynomials. While this is an elegant technique, and also a useful one, we shall, for 
the sake of brevity, use the following lemma instead. Formal derivatives are developed 
in Exercises 15 through 22. 


Let F be an algebraic closure of F, and let 
FR) =x" Fay ax” | +---+ax+a9 


be any monic polynomial in F[x]. If (f(x)y” € F[x] andm-1+40in F, then f(x) € 
F [x], that is, all a; € F. 


We must show that a; € F, and we proceed, by induction on r, to show that a,_, € F. 
Forr = 1, 


(FQ) Sa On agg ea 
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Part X 


§1.12 Definition 
51.13 Theorem 


Proof 


51.14 Theorem 


Proof 


Automorphisms and Galois Theory 


Since (f(x))” € F[x], we have, in particular, 
(m+ 1)dy-1 € F. 
Thus d,_) € F, sincem- 14 0in F. 


As induction hypothesis, suppose that a,_, € F forr = 1, 2,---,k. Then the coef- 
ficient of x”"-"+) in (f(x))™ is of the form 


(m » W)an—certy + Bk41(Gn—1s Gn—2, °° +, Unk) 
where gx11(@a_1; Gn_2,°++, Gn—g) is a formal polynomial expression in @j_1, Gn—2,°-°, 
dn _—z. By the induction hypothesis that we just stated, 2441(@p-1, Gn—2,°**, Gn-k) € FP, 
80 Gn_(k41) € F, sincem-1 AOin F. ¢ 


We are now in a position to handle fields F of characteristic zero and to show that 
for a finite extension E of F, we have {E : F} = [E : F). By definition, this amounts to 
proving that every finite extension of a field of characteristic zero is a separable extension. 
First, we give a definition. 


A field is perfect if every finite extension is a separable extension. a 
Every field of characteristic zero is perfect. 


Let E be a finite extension of a field F of characteristic zero, and let wa € E. Then 
f(x) = irr(a, F) factors in F[x] into [],(~ — @;)”, where the a; are the distinct zeros of 
irréy, F), and, say, a = a1. Thus 


f@= ( [Ie - x) 


and since v - 1 40 fora field F of characteristic 0, we must have 


(Te = a) é Fix] 


by Lemma 51.11. Since f(x) is irreducible and of minimal degree in F[x] having @ 
as a zero, we then see that v = 1. Therefore, a is separable over F for alla € E. By 
Corollary 51.10, this means that E is a separable extension of F. ¢ 


Lemma 54.11 will also get us through for the case of a finite field, although the 
proof is a bit harder. 


Every finite field is perfect. 


Let F be a finite field of characteristic p, and let E bea finite extension of F. Let w <€ E. 
We need to show that a is separable over F. Now f(x) = irr(a, F) factors in F into 
[],(« — ;)”, where the @; are the distinct zeros of f(x), and, say, @ = a. Let v = ple, 


§1,.15 Theorem 


Proof 
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where p does not divide e. Then 


fa)=[]a-a)’= (Tle - a") 


is in F[x]}, and by Lemma 54.11, J],(« — a@;)” is in F[x] since e- 1 ~ Oin F. Since 
f(x) = irra, F) is of minimal degree over F having a as a zero, we must have e = 1. 
Theorem 48.19 and the remark following it show then that 


fa)= [Ie = aj)" = I] (x? — a"), 


Thus, if we regard f(x) as Bx? ), we must have g(xye F[x]. Now g(x) 1 is separable 
over F with distinct zeros a? . Consider F (a? ‘= = F(a’). Then F(a” y is separable 
over F. Since x? —a? = Ge — a)’, we see that w is the only zero of x?° — a” in F. 
As a finite-dimensional vector space over a finite field F, F(«”') must be again a finite 
field. Hence the map 


Opt F(a?) > F(a?) 


given by o,(a) = a? fora ¢ F (w?’) is an automorphism of F(a’) by Theorem 48.19. 
Consequently, (a,)' is also an automorphism of F (a?’), and 


(op) (a) =a”. 


Since an automorphism of F (a?) is an onto map, there is B € F (a?’) such that 
(o,)'(B) =a". But then £2’ = w?’, and we saw that w was the only zero of xP — a?’ , 
so we must have 6 = a. Since B € F(a”), we have F(a) = F(a’). Since F(a?') was 
separable over F’, we now see that F(a) is separable over F. Therefore, a is separable 
over F andt = 0. 

We have shown that fora € E, a is separable over F. Then by Corollary 51.10, E 
is a separable extension of F. Ad 


We have completed our aim, which was to show that fields of characteristic 0 and 
finite fields have only separable finite extensions, that is, these fields are perfect. For 
finite extensions E of such perfect fields F, we then have [E : F]) = {E: F}. 


The Primitive Element Theorem 
The following theorem is a classic of field theory. 
(Primitive Element Theorem) Let £ be a finite separable extension of a field F. Then 


there exists a € E such that F = F(a). (Such an element a is a primitive element.) 
That is, a finite separable extension of a field is a simple extension. 


If F is a finite field, then F is also finite. Let a be a generator for the cylic group E* of 
nonzero elements of F under multiplication. (See Theorem 33.5.) Clearly, E = F(a), 
So @ is a primitive element in this case. 
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We now assume that F is infinite, and prove our theorem in the case that EF = 
F(B, y). The induction argument from this to the general case is straightforward. Let 
inr(f, F’) have distinct zeros B = B,,---, By, and let irr(y, F) have distinct zeros y = 
Yi, **+s Ym in F, where all zeros are of multiplicity 1, since E is a separable extension 
of F. Since F is infinite, we can find a € F such that 


aF (Bi — BY/(y — ys) 


for alli and j, with j # 1. That is, a(y — y;) ¢ 6; — B. Letting a = B + ay, we have 
a=B+ay # B + ayj;, 80 


a — ay; # B; 
for alli and all j # 1. Let f(x) = r(6, F), and consider 
h(x) = f(a — ax) € (F(@))[4]. 


Now hA(y) = f(B) = 0. However, h(y;) #0 for j #1 by construction, since the f; 
were the only zeros of f(x). Hence h(x) and g(x) = irr(y, F) have a common factor 
in (F(q@))[x], namely irr(y, F(«)), which must be linear, since y is the only common 
zero of g(x) and A(x). Thus y € F(q@), and therefore 8 = w — ay is in F(a). Hence 
F(B, vy) = F(@). ¢ 


51.16 Corollary A finite extension of a field of characteristic zero is a simple extension. 


Proof This corollary follows at once from Theorems 51.13 and 51.15. ¢ 


We see that the only possible “‘bad case” where a finite extension may not be simple 
is a finite extension of an infinite field of characteristic p 4 0. 


@ EXERCISES 51 


Computations 


In Exercises 1 through 4, find a such that the given field is Q(a). Show that your @ is indeed in the given field. 
Verify by direct computation that the given generators for the extension of Q can indeed be expressed as formal 
polynomials in your @ with coefficients in Q. 


1. QV2, 72) 2. Q0/2, </2) 
3. QV2, V3) 4, QU, V2) 
Concepts 


In Exercises 5 and 6, correct the definition of the italicized term without reference to the text, if correction is needed, 
so that it is in a form acceptable for publication. 


5. Let F be an algebraic closure of a field F. The multiplicity of a zero a € F of a polynomial f(x) € F[x] is 
v € Zt if and only if (x — a)” is the highest power of x — o that is a factor of f(x) in F[x]. 


6. Let F be an algebraic closure of a field F. An element a in F is separable over F if and only if a is a zero of 
multiplicity 1 of irr(@, F). 
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7, Give an example of an f(x) € Q[x] that has no zeros in Q but whose zeros in C are all of multiplicity 2. 
Explain how this is consistent with Theorem 51.13, which shows that Q is perfect. 
8. Mark each of the following true or false. 
a. Every finite extension of every field F is separable over F. 
b. Every finite extension of every finite field F is separable over F. 
c. Every field of characteristic 0 is perfect. 
d. Every polynomial of degree n over every field F always has n distinct zeros in F. 
e. Every polynomial of degree n over every perfect field F always has n distinct zeros in F. 
f. Every irreducible polynomial of degree n over every perfect field F always has n distinct zeros 
in F. 
g. Every algebraically closed field is perfect. 
h. Every field F has an algebraic extension E that is perfect. 
i. If E is a finite separable splitting field extension of F, then |G(E/F)| =[E: F]. 
j. If Z is a finite splitting field extension of F, then |G(E/F)} divides [EZ : F]. 
Theory 
9, Show that if w, 6 € F are both separable over F, then a + £, wf, and a/8, if 8 4 0, are all separable over F. 
{Hint: Use Theorem 51.9 and its corollary. ] 
10. Show that {1, y,---, y?~!} is a basis for Z,(y) over Z,(y?), where y is an indeterminate. Referring to Exam- 


11. 
12. 


13. 


14. 


15. 


ple 51.4, conclude by a degree argument that x? — 7 is irreducible over Z(t), where t = y?. 
Prove that if E is an algebraic extension of a perfect field F, then E is perfect. 


A (possibly infinite) algebraic extension E ofa field F is a separable extension of F if for every a € E, F(a) 
is a separable extension of F’, in the sense defined in the text. Show that if E is a (possibly infinite) separable 
extension of F and K is a (possibly infinite) separable extension of E, then K is a separable extension of F. 


Let £ be an algebraic extension of a field F. Show that the set of all elements in E that are separable over F 
forms a subfield of FE, the separable closure of F in E. [Hint: Use Exercise 9.] 


Let E be a finite field of order p”. 
a. Show that the Frobenius automorphism o, has order n. 
b. Deduce from part (a) that G(E/Z,) is cyclic of order n with generator o,. [Hint: Remember that 
|G(E/F)| ={E: F}=[E: F] 
for a finite separable splitting field extension E over F’.] 
Exercises 15 through 22 introduce formal derivatives in F [x]. 
Let F be any field and let f(x) = a9 +. ax +--+ + a;x' 4+---+a,x” be in F[x]. The derivative f’(x) of 
f(x) is the polynomial 
f@=a te +@- Dax?! +--+: Dax", 
where i - | has its usual meaning fori € Z* and 1 € F. These are formal derivatives; no “limits” are involved 
here. 
a. Prove that the map D: F[x] > F[x] given by D(f(x)) = f'(x) is a homomorphism of (F[x], +). 
b. Find the kernel of D in the case that F is of characteristic 0. 
c. Find the kernel of D in the case that F is of characteristic p 4 0. 
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16. 


17. 


18. 


19. 


20. 


21. 


22. 
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Continuing the ideas of Exercise 15, shows that: 


a. D(af(x)) = aD(f(x)) for all f(x) ¢ F[x]andae F. 

b. DCf(x)g(x)) = f(x)g’(x) + f’(x)g(x) for all f(x), g(x) € Fla]. [Hint: Use part (a) of this exercise and 
the preceding exercise and proceed by induction on the degree of f(x)g(x).] 

ce Df") = (m - Df y""! f'(x) for all f(x) € Fx]. [Hint: Use part (b).] 

Let f(x) € F[x], and leta € F be a zero of f(x) of multiplicity v. Show that v > 1 if and only if & is also a 

zero of f’(x). [Hint: Apply parts (b) and (c) of Exercise 16 to the factorization f(x) = (x — aw)” g(x) of f@) 

in the ring F[x].] 

Show from Exercise 17 that every irreducible polynomial over a field F of characteristic 0 is separable. [Hint: 

Use the fact that irr(a, F) is the minimal polynomial for a over F.} 

Show from Exercise 17 that an irreducible polynomial q(x) over a field F of characteristic p 4 0 is not separable 

if and only if each exponent of each term of g(x) is divisible by p. 

Generalize Exercise 17, showing that f(x) € F[x] has no zero of multiplicity >1 if and only if f(x) and f (x) 

have no common factor in F[x] of degree >0. 

Working a bit harder than in Exercise 20, show that f(x) € F[x] has no zero of multiplicity >1 if and only if 

f(x) and f/(x) have no common nonconstant factor in F[x]. [Hint: Use Theorem 46.9 to show that if 1 is a 

gcd of f(x) and f'(x) in F[x], it is a ged of these polynomials in F[x] also.] 

Describe a feasible computational procedure for determining whether f(x) € F[x] hasazero of multiplicity >1, 

without actually finding the zeros of f(x). [Hint: Use Exercise 21.] 


tToraLLy INSEPARABLE EXTENSIONS 


This section shows that a finite extension E of a field F can be split into two stages: a 
separable extension K of F, followed by a further extension of K to E that is as far from 
being separable as one can imagine. 

We develop our theory of totally inseparable extensions in a fashion parallel to our 
development of separable extensions. 


52.1 Definition _A finite extension E ofa field F is a totally inseparable extension of F if {EZ : F} =1< 


[E : F]. Anelement q of F is totally inseparable over F if F(«) is totally inseparable 
over F. a 


We know that {F(a) : F} is the number of distinct zeros of inr(a, F). Thus @ is 
totally inseparable over F if and only if irr(a, F) has only one zero that is of mullti- 
plicity >1. 


52.2 Example Referring to Example 51.4, we see that Z,(y) is totally inseparable over Zp(y?), where y 
is an indeterminate. A 


52.3 Theorem (Counterpart of Theorem 51.9) If K isa finite extension of F, E is a finite extension 
of F, and F < E < K, then K is totally inseparable over F if and only if K is totally 
inseparable over E and E is totally inseparable over F. 


| This section is not used in the remainder of the text. 


Proof 


52.4 Corollary 


Proof 


52.5 Theorem 
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Since F < E < K,wehave{K : E] > Land[£: F] > 1. Suppose K is totally insepa- 
rable over F. Then {K : F} = 1, and 


{K : FJ} ={K: E}\{E: F}, 
so We must have 
{K :E}=1<[K:E] and {E: F}=1 <[E: F}. 


Thus K is totally inseparable over E, and E is totally inseparable over F. 
Conversely, if K is totally inseparable over E and E is totally inseparable over F, 
then 


{K: FPF} ={K: EYE: F}=A\)D=1, 
and [K : F] > 1. Thus K is totally inseparable over F. ° 


Theorem 52.3 can be extended by induction, to any finite proper tower of finite 
extensions. The top field is a totally inseparable extension of the bottom one if and only 
if each field is a totally inseparable extension of the one immediately under it. 


(Counterpart of the Corollary of Theorem 51.10) If £ is a finite extension of F,, then 
E is totally inseparable over F if and only if each a in E,a + F, is totally inseparable 
over F. 


Suppose that E is totally inseparable over F, and let a € E, witha ¢ F. Then 
F<F(@)<E. 


If F(a) = E, we are done, by the definition of @ totally inseparable over F. If F < 
F(a) < E, then Theorem 52.3 shows that since E is totally inseparable over F, F(a) is 
totally inseparable over F. 

Conversely, suppose that for every a € E, with a ¢ F,a is totally inseparable 
over F’. Since E is finite over F, there exist a, ---, a, such that 


F < F(a) < Flaj,a2) <---< E= F(qy,---, ap). 


Now since q; is totally inseparable over F, a; 1s totally inseparable over F(a, ---, a;-1), 
because g(x) = irr(a;, F(a@i,---, @;-1)) divides irr(@;, F) so that a; is the only zero 
of g(x) and is of multiplicity >1. Thus F(a,,---,a;) is totally inseparable over 
F(a,,---,aj—-1), and E is totally inseparable over F, by Theorem 52.3, extended by 
induction. 5 


Thus far we have so closely paralleled our work in Section 51 that we could have 
handled these ideas together. 


Separable Closures 
We now come to our main reason for including this material. 


Let F have characteristic p # 0, and let E bea finite extension of F. Thena € E,a ¢ F, 
is totally inseparable over F if and only if there is some integer t > 1 such thata? € F. 
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Proof 
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Furthermore, there is a unique extension K of F, with F < K < E, such that K is 
separable over F, and either E = K or E is totally inseparable over K. 


Leta € E,a € F, be totally inseparable over F. Then irr(w, F’) has just one zero a of 
multiplicity >1, and, as shown in the proof of Theorem 51.14, irr(a, F) must be of the 
form 


xP —a?., 


Hence a” € F for somer > 1. 
Conversely, if a?’ € F for some ft > 1, where a € E anda ¢ F, then 


xP — a? = (x-a), 


and (x”’ — a") € F[x], showing that irr(a, F) divides (x — a)”’. Thus irr(a, F) has a 
as its only zero and this zero is of multiplicity >1, so @ is totally inseparable over F. 


For the second part of the theorem, let E = F(q@,---,a,). Then if 
irr(a;, F) = T] (x?" _ a"), 
j 
with a1 = a), let Bj; = a . We have F(611, 621,-+--. Bat) < £, and B;1 is a zero of 


filx) =| [@ - By), 
j 


where fj(x) € F[x]. Now since raising to the power p is an isomorphism o, of E onto 
a subfield of E, raising to the power of p’ is the isomorphic mapping (o,)' of E onto a 
subfield of E. Thus since the «;; are all distinct for a fixed i, so are the B,,; for a fixed i. 
Thérefore, 6j; is separable over F, because it is a zero of a polynomial f;(x) in F[x] 
with zeros of multiplicity 1. Then 


K = F(Bu, Bar, +++, Bnt) 


is separable over F, by the proof of Corollary 51.10. If all p* = 1, then K = E. If 
some p“ £1, then K + E, and a? = Bj; isin K, showing that each a; ¢ K is totally 
inseparable over K, by the first part of this theorem. Hence FE = K(a,---, a,) is totally 
inseparable over K, by the proof of Corollary 52.4. 

It follows from Corollaries 51.10 and 52.4 that the field K consists of all elements a 
in £ that are separable over F. Thus K is unique. e 


The unique field K of Theorem 52.5 is the separable closure of F in E. | 

The preceding theorem shows the precise structure of totally inseparable extensions 
of a field of characteristic p. Such an extension can be obtained by repeatedly adjoining 
pth roots of elements that are not already pth powers. 

We remark that Theorem 52.5 is true for infinite algebraic extensions E of F’. The 
proof of the first assertion of the theorem is valid for the case of infinite extensions also. 
For the second part, since w + 8, a8, anda/B, for 8 # 0, are all contained in the field 
F(a, B), all elements of E separable over F form a subfield K of E, the separable 
closure of F in E. It follows that ana ¢ E,a ¢ K, is totally inseparable over K, since 
a and all coefficients of irr(a, K) are in a finite extension of Ff, and then Theorem 52.5 
can be applied. 
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l@ EXERCISES 52 


Concepts 


1. 
2. 


BY 


Let y and z be indeterminates, andlet u = y!* and v = z!®. Describe the separable closure of Za(u, v) in Z3(y, z). 
Let y and z be indeterminates, and let u = y! and v = y2z!8 
Za(y, Z). 


Referring to Exercise 1, describe the totally inseparable closure (see Exercise 6) of Z3(u, v) in Za(y, z). 


. Describe the separable closure of Z3(u, v) in 


. Referring to Exercise 2, describe the totally inseparable closure of Z3(u, v) in Z3(y, z). (See Exercise 6.) 
. Mark each of the following true or false. 


a. No proper algebraic extension of an infinite field of characteristic p 4 Ois ever a separable extension. 
_____ b. If F(@) is totally inseparable over F of characteristic p 4 0, thena” € F for somet > 0. 

c. For an indeterminate y, Zs(y) is separable over Zs(y°). 
______ d. For an indeterminate y, Z5(y) is separable over Z5(y!°). 

e. For an indeterminate y, Zs(y) is totally inseparable over Zs(y 
f. If F is a field and a is algebraic over F’, then @ is either separable or totally inseparable over F’. 
g. If E is an algebraic extension of a field F, then F has a separable closure in E. 


___. h. If E is an algebraic extension of a field F, then £ is totally inseparable over the separable closure 
of F in E. 


i. If E is an algebraic extension of a field F and E is not a separable extension of F,, then E is totally 
inseparable over the separable closure of F in E. 


j. If q is totally inseparable over F, then a is the only zero of irr(@, F). 


10) 


Theory 


6. 


7. 


8. 


Show that if Z is an algebraic extension of a field F, then the union of F with the set of all elements of E totally 
inseparable over F forms a subfield of £, the totally inseparable closure of F in E. 


Show that a field F of characteristic p # O is perfect if and only if F? = F, that is, every element of F is a pth 
power of some element of F’. 

Let E be a finite extension of a field F of characteristic p. In the notation of Exercise 7, show that E? = E if 
and only if F? = F. [Hint: The map o, : E — E defined by o,(a) = a? for a € E is an isomorphism onto a 
subfield of Z. Consider the diagram in Fig. 52.7, and make degree arguments.] 


52.7 Figure 
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53.1 Definition 


53.2 Theorem 


Automorphisms and Galois Theory 


GaLotis THEORY 


Résumé 


This section is perhaps the climax in elegance of the subject matter of the entire text. 
The Galois theory gives a beautiful interplay of group and field theory. Starting with 
Section 48, our work has been aimed at this goal. We shall start by recalling the main 
results we have developed and should have well in mind. 


1. LetF <E < Fo € E, and let £ be a conjugate of a over F, that is, irr(w, F) 
has 6 as a zero also. Then there is an isomorphism yg mapping F(a) onto 
F(B) that leaves F fixed and maps a onto f. 

2. If F< E < Fanda € E, then an automorphism o of F that leaves F fixed 
must map a onto some conjugate of a over F’. 

3. If F < E, the collection of all automorphisms of E leaving F fixed forms a 
group G(E/F). For any subset S of G(E/F), the set of all elements of E left 
fixed by all elements of S is a field Es. Also, F < Eqce/r. 

4. A field E, F < E < F,isa splitting field over F if and only if every 
isomorphism of E onto a subfield of F leaving F fixed is an automorphism 
of E. If E is a finite extension and a splitting field over F, then 
|G(E/F)| ={E : F}. 

5. If E isa finite extension of F, then {E : F} divides [E : F]. If E is also 
separable over F, then {E : F} = [E: F]. Also, E is separable over F if and 
only if irr(a, F) has all zeros of multiplicity 1 for every a ¢ E. 

6. If £ isa finite extension of F and is a separable splitting field over F, then 
IG(E/P)|={E: F} = [E: FI. 


Normal Extensions 


We are going to be interested in finite extensions K of F such that every isomorphism 
of K onto a subfield of F leaving F fixed is an automorphism of K and such that 


[K: F]={K:F}. 


In view of results 4 and 5, these are the finite extensions of F that are separable splitting 
fields over F’. 


A finite extension K of F is a finite normal extension of F if K is a separable splitting 
field over F. a 


Suppose that K is a finite normal extension of F’, where K < F, as usual. Then by 
result 4, every automorphism of F leaving F fixed induces an automorphism of K. As 
before, we let G(K / F) be the group of all automorphisms of K leaving F fixed. After 
one more result, we shall be ready to illustrate the main theorem. 


Let K be a finite normal extension of F, and let E be an extension of F, where F < E < 
K < F.Then K isa finite normal extension of EZ, and G(K /E) is precisely the subgroup 


Proof 


53.3 Example 
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of G(K/F) consisting of all those automorphisms that leave E fixed. Moreover, two 
automorphisms o and t in G(K/F) induce the same isomorphism of E onto a subfield 
of F if and only if they are in the same left coset of G(K /E) in GK /F). 


If K is the splitting field of a set {f;(x) |i € J} of polynomials in F[x], then K is 
the splitting field over E of this same set of polynomials viewed as elements of E[x]. 
Theorem 51.9 shows that K is separable over E, since K is separable over F. Thus K 
is a normal extension of FE. This establishes our first contention. 

Now every element of G(K /E) is an automorphism of K leaving F fixed, since it 
even leaves the possibly larger field E fixed. Thus G(K/E) can be viewed as a subset 
of G(K/F). Since G(K/E) is a group under function composition also, we see that 
G(K/E) < G(K/F). 

Finally, for o and t in G(K/F),o and t are in the same left coset of G(K /E) if 
and only if t-!o € G(K/E) or if and only ifo = ty for uy € G(K/E). Butifo = tu 
for u € G(K/E), then fora € E, we have 


o(a@) = (TUM) = T(U(@)) = T(@), 
since ju(@) = a fora € E. Conversely, if ¢(@) = t(@) for alla € EZ, then 
(toa) = o 
for alla € E,sot~!o leaves E fixed, and p = t~'c is thus in G(K/E). @ 


The preceding theorem shows that there is a one-to-one correspondence between 
left cosets of G(K /E) in G(K /F) and isomorphisms of £ onto a subfield of K leaving 
F fixed. Note that we cannot say that these left cosets correspond to automorphisms of E 
over F, since E may not be a splitting field over F. Of course, if E is anormal extension 
of F, then these isomorphisms would be automorphisms of E over F. We might guess 
that this will happen if and only if G(K/E) is anormal subgroup of G(K/F), and this 
is indeed the case. That is, the two different uses of the word normal are really closely 
related. Thus if F is anormal extension of F, then the left cosets of G(K /E) in G(K /F) 
can be viewed as elements of the factor group G(K /F)/G(K /E), which is then a group 
of automorphisms acting on E and leaving F fixed. We shall show that this factor group 
is isomorphic to G(E/F). 


The Main Thecrem 


The Main Theorem of Galois Theory states that for a finite normal extension K of a 
field F, there is a one-to-one correspondence between the subgroups of G(K/F) and 
the intermediate fields F, where F < E < K. This correspondence associates with each 
intermediate field E the subgroup G(K/E). We can also go the other way and start with 
a subgroup H of G(K /F) and associate with H its fixed field Ky. We shall illustrate 
this with an example, then state the theorem and discuss its proof. 


Let K = Q(V2, V3). Now K is a normal extension of Q, and Example 48.17 showed 
that there are four automorphisms of K leaving Q fixed. We recall them by giving their 
values on the basis {1, J2, 43, V6} for K over Q. 
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{4, 0, %, 03} 
{t, o} {t, a9} {t, o3} 
{e} 
(a) 
QV2, V3) = Ky 
Kya) = WV) QW/2) = Kio, QWV6) = Kio) 
Q= Ky, 01, Oy, O3} 


(b) 


53.4 Figure (a) Group diagram. (b) Field diagram. 


t: The identity map 
o, : Maps J/2 onto —+/2, V6 onto —6, and leaves the others fixed 
2: Maps J3 onto —/3, /6 onto —/6, and leaves the others fixed 
03: Maps 4/2 onto —/2, /3 onto —/3, and leaves the others fixed 


We saw that {1, o;, 02, 03} is isomorphic to the Klein 4-group. The complete list of 
subgroups, with each subgroup paired off with the corresponding intermediate field that 
it leaves fixed, is as follows: 


{t, 01, 02, 03} + Q, 
{1,01} + QV3), 
{t, 02} + Qiv2), 
{t, 03} <> Q(V6), 
{u} <> Q(V2, V3). 


All subgroups of the abelian group f{, 01, 02,03} are normal subgroups, and all the 
intermediate fields are normal extensions of Q. Isn’t that elegant? 

Note that if one subgroup is contained in another, then the larger of the two subgroups 
corresponds to the smaller of the two corresponding fixed fields. The larger the subgroup, 
that is, the more automorphisms, the smaller the fixed field, that is, the fewer elements left 
fixed. In Fig. 53.4 we give the corresponding diagrams for the subgroups and intermediate 
fields. Note again that the groups near the top correspond to the fields near the bottom. 
That is, one diagram looks like the other inverted or turned upside down. Since here each 
diagram actually looks like itself turned upside down, this is not a good example for us 
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to use to illustrate this inversion principle. Turn ahead to Fig. 54.6 to see diagrams that 
do not look like their own inversions. A 


If K is a finite normal extension of a field F, then G(K /F) is the Galois group of K 
over F. | 


We shall now state the main theorem, then give another example, and finally, com- 
plete the proof of the main theorem. 


(Main Theorem of Galois Theory) Let K be a finite normal extension of a field F, 
with Galois group G(K /F). Fora field E, where F < E < K, let A(E) be the subgroup 
of G(K /F) leaving E fixed. Then 4 is a one-to-one map of the set of all such intermediate 
fields E onto the set of all subgroups of G(K /F). The following properties hold for A: 


J. ACE) = G(K/E). 

2 E= Ko(K/e) = Kye). 

3. For H < G(K/F), (Eg) =H. 

4, [K:E]=|A(E)| and [£: F] =(G(K/F): A(E)), the number of left cosets 
of ACE) in G(K/F). 

5. Eis anormal extension of F if and only if 4(£) is a normal subgroup of 
G(K /F). When (E) is a normal subgroup of G(K/F), then 

G(E/F) =~ G(K/F)/G(K/E). 


6. The diagram of subgroups of G(K /F) is the inverted diagram of intermediate 
fields of K over F. 


Observations on the Proof We have really already proved a substantial part of this 
theorem. Let us see just how much we have left to prove. 
Property 1 is just the definition of A found in the statement of the theorem. For 
Property 2, Theorem 48.15 shows that 
E < Kqx/s)- 
Let a € K, where w ¢ E. Since K is a normal extension of EF, by using a conjugation 
isomorphism and the Isomorphism Extension Theorem, we can find an automorphism 
of K leaving E fixed and mapping a onto a different zero of irr(a, F'). This implies that 
Keceyey SE, 
so E = Kgx/g). This disposes of Property 2 and also tells us that 4 is one to one, for if 
A(E1) = A(E2), then by Property 2, we have 
Ey = Kia = Kite = £2. 
Now Property 3 is going to be our main job. This amounts exactly to showing that 
i. is an onto map. Of course, for H < G(K/F), we have H < A(Ky), for A surely is 
included in the set of all automorphisms leaving K y fixed. Here we will be using strongly 
our property [K : £] ={K: E£}. 
Property 4 follows from [K : E] = {K : E},[E: F] ={E: F}, and the last state- 
ment in Theorem 53,2. 
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We shall have to show that the two senses of the word normal correspond for Prop- 
erty 5. 

We have already disposed of Property 6 in Example 53.3. Thus only Properties 3 
and 5 remain to be proved. 


The Main Theorem of Galois Theory is a strong tool in the study of zeros of poly- 
nomials. If f(x) ¢ F [x] is such that every irreducible factor of f(x) is separable over F, 
then the splitting field K of f(x) over F is a normal extension of F’. The Galois group 
G(K/F) is the group of the polynomial f(x) over F. The structure of this group 
may give considerable information regarding the zeros of f(x). This will be strikingly 
illustrated in Section 56 when we achieve our final goal. 


Galois Groups over Finite Fields 


Let K bea finite extension of a finite field F . We have seen that K is a separable extension 
of F (a finite field is perfect). Suppose that the order of F is p” and |[K : F] =n, so the 
order of K is p’. Then we have seen that K is the splitting field of x?" — x over F. 
Hence K is a normal extension of F. 

Now one automorphism of K that leaves F fixed is op, where fora € K, op (a) = 
a?” . Note that (opr)'(@) = @?”. Since a polynomial of degree p” can have at most p”™ 
zeros in a field, we see that the smallest power of o,- that could possibly leave all p’” 
elements of K fixed is the nth power. That is, the order of the element op in G(K/F) is 
at least n. Therefore, since |G(K/F)| = [K : F] =n, it must be that G(K/F) is cyclic 
and generated by o,-. We summarize these arguments in a theorem. 


Let K be a finite extension of degree v of a finite field F of p’ elements. Then G(K/F) 
is cyclic of order n, and is generated by o,, where for a € K, op (a) = a? 


We use this theorem to give another illustration of the Main Theorem of Galois 
Theory. 


Let F = Zy, andlet K = GF(p'”), so[K : F] = 12. Then G(K /F) is isomorphic to the 
cyclic group (Z,,, +). The diagrams for the subgroups and for the intermediate fields 
are given in Fig. 53.9. Again, each diagram is not only the inversion of the other, but 
unfortunately, also looks like the inversion of itself. Examples where the diagrams do 
not look like their own inversion are given in next Section 54. We describe the cyclic 


(,) = G(K/F) K=GF(p!2) = Kjy 
2 a a \ 
a - N yer we Ko) = GFP") GRP) = Bigs 
ae 
ae wa ee Kg; = GF(’) GF(p*) = Kio, 
te} F = Zp=GF(p) = Kyo, 


(a) (b) 


53.9 Figure (a) Group diagram. (b) Field diagram. 


Proof 
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subgroups of G(K /F) = (o,) by giving generators, for example, 


io, = lisOp Gp |: A 


Proof of the Main Theorem Completed 


We saw that Properties 3 and 5 are all that remain to be proved in the Main Theorem of 
Galois Theory. 


Turning to Property 3, we must show that for H < G(K/F), \(Ky) = H. We know that 
H <A(Ky) < G(K/F). Thus what we really must show is that it is impossible to have 
H a proper subgroup of 4(K 7). We shall suppose that 


A e<i(Ky) 


and shall derive a contradiction. As a finite separable extension, K = Ky(a) for some 
a € K, by Theorem 51.15. Let 


n=[K: Ky] ={K : Ky} = |G(K/Ku)}. 


Then H < G(K/Ky) implies that |H| < |G(K/Ky)| =n. Thus we would have to 
have |H| < [K : Ky] =n. Let the elements of H be oj,---, 0), and consider the 
polynomial 

|A| 

ff) =[]@-a@). 

i=t 
Then f(x) is of degree |H| <n. Now the coefficients of each power of x in f(x) are 
symmetric expressions in the o;(a). For example, the coefficient of x'¥!~! is —o\(a) — 
ox(@) — +++ — Oym\(). Thus these coefficients are invariant under each isomorphism 
o; € H, since ifo € H, then 


O01,°"',00|n| 


is again the sequence a}, ---, oj#), except for order, H being a group. Hence f(x) has 
coefficients in Kj, and since some a; is 4, we see that some o;(@) is a, so f(a) = 
Therefore, we would have 


deg(a, Ky) < |H| <n =[K: Ky] =[Kn@): Ku]. 


This is impossible. Thus we have proved Property 3. 

We turn to Property 5. Every extension E of F, F < E < K, is separable over F, 
by Theorem 51.9. Thus E is normal over F if and only if E is a splitting field over F. 
By the Isomorphism Extension Theorem, every isomorphism of E onto a subfield of 
F leaving F fixed can be extended to an automorphism of K, since K is normal over 
F. Thus the automorphisms of G(K /F) induce all possible isomorphisms of E onto a 
subfield of F leaving F fixed. By Theorem 50.3, this shows that E is a splitting field 
over F, and hence is normal over F, if and only if for allo €¢ G(K/F) anda € E, 


o(aye FE. 


By Property 2, E is the fixed field of G(K/E), so o(a) € E if and only if for all 
t €G(K/E) 


t(o(a@)) = o(@). 
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This in turn holds if and only if 
(oc 'to\a) =a 


foralla ¢ E,o € G(K/F),andt € G(K/E). But this means that for allo ¢ G(K/F) 
and t € G(K/E), a~'to leaves every element of E fixed, that is, 


(o-!to0) € G(K/E). 


This is precisely the condition that G(K /F) be a normal subgroup of G(K /F). 

It remains for us to show that when F is a normal extension of F, G(E/F) = 
G(K/F)/G(K/E). For 0 € G(K/F), let og be the automorphism of E induced by o 
(we are assuming that E is a normal extension of F’). Thus og € G(E/F). The map 
@: G(K/F) > G(K/F) given by 


oo) = oR 


for o € G(K/F) is a homomorphism. By the Isomorphism Extension Theorem, every 
automorphism of E leaving F fixed can be extended to some automorphism of K;; that 
is, itis Tz for some tT € G(K/F). Thus ¢ is onto G(E/F). The kernel of ¢ is G(K/E). 
Therefore, by the Fundamental Isomorphism Theorem, G(E/F) ~ G(K/F)/G(K/E). 
Furthermore, this isomorphism is a natural one. 5 


EXERCISES 53 


Computations 


The field K = Q(/2, V3, V5) is a finite normal extension of Q. It can be shown that [K : Q] = 8. In Exercises 1 
through 8, compute the indicated numerical quantity. The notation is that of Theorem 53.6. 


1. {Kk : Q} 2. |G(K/Q)| 

3. [A(Q)| 4. |A(Q(V2, V3))| 
5. |A(Q(V6))| 6. |A(Q(V30))| 

7. [MQ(/2 + V6))| 8. |A(K)| 

9. Describe the group of the polynomial (x* — 1) € Q[x] over Q. 


10. Give the order and describe a generator of the group G(GF(729)/GF(9)). 
11. Let K be the splitting field of x? — 2 over Q. (Refer to Example 50.9.) 


a. Describe the six elements of G(K/Q) by giving their values on \/2 and i/3. (By Example 50.9, K = 
QW/2, i73).) 
b. To what group we have seen before is G(K /Q) isomorphic? 


c. Using the notation given in the answer to part (a) in the back of the text, give the diagrams for the subfields 
of K and for the subgroups of G(K /Q), indicating corresponding intermediate fields and subgroups, as we 
did in Fig. 53.4. 


12. Describe the group of the polynomial (x* — 5x? + 6) € Q[x] over Q. 
13. Describe the group of the polynomial (x? — 1) € Q[x] over Q. 
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Concepts 


14, 


15. 


Give an example of two finite normal extensions K, and K> of the same field F such that K; and K> are not 
isomorphic fields but G(K/F) ~ G(K2/F). 


Mark each of the following true or false. 


a. Two different subgroups of a Galois group may have the same fixed field. 

b. In the notation of Theorem 53.6, if F < E <L < K, then aA(E) < A(L). 

ce. If K is a finite normal extension of F, then K is a normal extension of E, where F < E < K. 

d. If two finite normal extensions E and L of a field F have isomorphic Galois groups, then [E : 
F)=[L: F]. 

e. If £ is a finite normal extension of F and A is anormal subgroup of G(E/F), then Ey is anormal 
extension of F. 

—____ f. If E is any finite normal simple extension of a field F, then the Galois group G(E/F) is a simple 

group. 
. No Galois group is simple. 
. The Galois group of a finite extension of a finite field is abelian. 


. Anextension £ of degree 2 over a field F is always a normal extension of F. 


An extension E of degree 2 over a field F is always a normal extension of F if the characteristic 
of F is not 2. 


oe) 


Theory 


16. 


17, 


18. 


19, 


A finite normal extension K of a field F is abelian over F if G(K/F) is an abelian group. Show that if K 
is abelian over F and £ is a normal extension of F, where F < E < K, then K is abelian over E and E is 
abelian over F. 


Let K be a finite normal extension of a field F. Prove that for every a € K, the norm of a over F, given by 
Nxjr@= [[ o@, 
ceG(K/F) 
and the trace of a over F’, given by 
Trxjr(a@) = a a(a), 
o€G(K/F) 
are elements of F. 


Consider K = Q(./2, V3). Referring to Exercise 17, compute each of the following (see Example 53.3). 


a. Nxjo(V2) b. NxjgQ(V2 + V3) 
ce. Nx/Q(V/6) d. NxjQ) 
e. TrKjQ(V2) f. Trx/o(V2 + V3) 
g. Trejo(V6) h. Trx/Q(2) 


Let K be anormal extension of /’, and let K = F(a). Let 
inr(a, F) = x” 4 ag_yx" | +++ bayx + ap. 
Referring to Exercise 17, show that 


a. Nxjrla) = (-1)" a0, b. Tree) = —an-1. 
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20. 


21. 


22. 


23. 


24, 


25. 


26. 
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Let f(x) € F[x] be a polynomial of degree n such that each irreducible factor is separable over F. Show that 
the order of the group of f(x) over F divides n!. 


Let f(x) € F[x] be a polynomial such that every irreducible factor of f(x) is a separable polynomial over F. 
Show that the group of f(x) over F can be viewed in a natural way as a group of permutations of the zeros of 
F(x) in F. 

Let F be a field and let ¢ be a primitive nth root of unity in F, where the characteristic of F is either 0 or does 
not divide n. 


a. Show that #(C) is a normal extension of F. 

b. Show that G(F(¢)/F) is abelian. [Hint: Every o € G(F(¢)/F) maps ¢ onto some ¢" and is completely 
determined by this value r.] 

A finite normal extension K of a field F is cyclic over F if G(K/F) is a cyclic group. 

a. Show that if K is cyclic over F and E is a normal extension of #, where F < E < K, then E is cyclic over 
F and K is cyclic over E. 

b. Show that if K is cyclic over F, then there exists exactly one field E, F < E < K, of degree d over F for 
each divisor d of [K : F']. 

Let K be a finite normal extension of F. 


a. For a € K, show that 


f@= [] @-ce@) 
oeG(K/F) 
isin F[x]. 
b. Referring to part (a). show that f(x) is a power of irr(a, F), and f(x) = ur(a, F) if and only if K = F(a). 
The join E v L of two extension fields E and L of F in F is the smallest subfield of F containing both E 
and L. Thatis, E V L is the intersection of all subfields of F containing both E and L. Let K bea finite normal 


extension of a field F, and let E and L be extensions of F contained in K, as shown in Fig. 53.10. Describe 
G(K/(E Vv L))in terms of G(K/E) and G(K/L). 


With reference to the situation in Exercise 25, describe G{K /(E M L)} in terms of G(K/E) and G(K/L). 


53.9 Figure 


54.1 Definition 
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ILLUSTRATIONS OF GALOIS THEORY 


Symmetric Functions 


Let F be a field, and let y,,---, y, be indeterminates. There are some natural auto- 
morphisms of F(y1,---. y,) leaving F fixed, namely, those defined by permutations of 
{y1,°-+, Yn}. To be more explicit, let o be a permutation of {1,---,n}, that is,o € S,. 
Then o gives rise to a natural map 0: F(y1,-++, Yn) > F(y1, +--+, Yn) given by 


(Su cake *)) _ f Yeas +++, Yow) 
BOI. Yn) (Vert), * +1 Yotn)) 


for fQ1,+++. In), BO. Yn) © Fly, Yeh, with 201, +++, Yn) #0. It is imme- 
diate that is an automorphism of F(y1,---, y,) leaving F fixed. The elements of 
F(y1,-++, Yn) left fixed by all o, for all o € S,, are those rational functions that are 
symmetric in the indeterminates y;,---. Yn. 


An element of the field F(y;,---, ¥,) is asymmetric function in y;,---, y, over F, if 
it is left fixed by all permutations of y;,---, y,, in the sense just explained. a 


Let S,, be the group of all the automorphisms @ for o € S,. Observe that S,, is 
naturally isomorphic to S,. Let K be the subfield of F(1,---, y,) which is the fixed 
field of S,. Consider the polynomial 


f()] [@ - ya: 
i=l 


this polynomial f(x) € (F()1,---., yn))[x] is a general polynomial of degree n. Let a, 
be the extension of @, in the natural way, to (F(1,--+, ¥,))[x], where ox) = x. Now 
F(x) is left fixed by each map o,; for o € S,; that is, 


] [@ -w =] [@ — yew). 
i=l] i=1 


Thus the coefficients of f(x) are in K; they are elementary symmetric functions in the 
yi,-°++, ¥n- As illustration, note that the constant term of f(x) is 
(—1)"yiy2-++ Yas 


the coefficient of x”! is -(y, + yo +--++ y,), and so on. These are symmetric func- 
tions in yj, °++, Ya. 
The first elementary symmetric function in y), +--+, yy is 


Ssr=yity2t--: + Ya, 


the second is 52. = yiy2 + yiy3 +--+ + Yn-1¥n, and so on, and the nthis sy = ypy2-++ Yp. 
Consider the field E = F(s1,---, 5,). Of course, E < K, where K is the field of all 
symmetric functions in y;,---, y, over F. But F(y;, ---, y,) is a finite normal extension 
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54.2 Theorem 
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Automorphisms and Galois Theory 


of EF, namely, the splitting field of 
an 
fe=[]oe-» 
i=l 


over E. Since the degree of f(x) isn, we have at once 
[FQ +++) Yn) E] <n} 
(see Exercise 13, Section 50). However, since K is the fixed field of S, and 
[Sal = [Sal =n, 
we have also 
Al <{F(y1,+++, nd) KY S [FO Yn): K). 
Therefore, 
ni <[F(y1.+++) Yn)? KI <P Ons yn) El <a, 
Ye) 
K=E. 


The full Galois group of F'(y1, +++, yn) over E is therefore S,. The fact that K = E shows 
that every symmetric function can be expressed as a rational function of the elementary 
symmetric functions s,, +--+, 5,. We summarize these results in a theorem. 


Let s1, ---, 8, be the elementary symmetric functions in the indeterminates y,,---, Yn. 
Then every symmetric function of y,, --+, y, over F is a rational function of the elemen- 
tary symmetric functions. Also, F'(y1, ---, y,) is a finite normal extension of degree n! 
of F(s;,---, s,), and the Galois group of this extension is naturally isomorphic to S,. 


In view of Cayley’s Theorem 8.16, it can be deduced from Theorem 54.2 that any 
finite group can occur as a Galois group (up to isomorphism). (See Exercise 11.) 


Examples 


Let us give our promised example of a finite normal extension having a Galois group 
whose subgroup diagram does not look like its own inversion. 


Consider the splitting field in C of x4 — 2 over Q. Now x* — 2 is irreducible over Q, by 
Eisenstein’s criterion, with p = 2. Let a = </2 be the real positive zero of x — 2. Then 
the four zeros of x* — 2 in C are w, —a, ia, and —ia, where 7 is the usual zero of x? + 1 
in C. The splitting field K of x+ — 2 over Q thus contains (iw)/a = i. Since a is a real 
number, Q(a) < R, so Q(a) # K. However, since Q(a, i) contains all zeros of x4 — 2, 
we see that Q(a, i) = K. Letting E = Q(@), we have the diagram in Fig. 54.4. 

Now {1, a, a, ot} is a basis for E over Q, and {1, i} is a basis for K over E. Thus 


2 23 © 3 wD 0 3. 
{1,a, a, a°, i, ia, ia*,ia?} 


K=Q(a,i) 


E=Q@) 


Q 
54.4 Figure 
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is a basis for K over Q. Since [K : Q] = 8, we must have |G(K /Q)| = 8, so we need to 
find eight automorphisms of K leaving Q fixed. We know that any such automorphism o 
is completely determined by its values on elements of the basis {1, a, a, a3, i, ia, ia”, 
ia}, and these values are in turn determined by o (a) and o(i). But o(e) must always be 
a conjugate of & over Q, that is, one of the four zeros of inr(a, Q) = x4 — 2. Likewise, 
o (i) must be a zero of irr(i, Q) = x? + 1. Thus the four possibilities for o(w), combined 
with the two possibilities for o(i), must give all eight automorphisms. We describe 
these in Table 54.5. For example, o3(@) = —ia and p3(i) = 1, while po is the identity 
automorphism. Now 


(Mi pi y(@) = wi (pi(@)) = wie) = “i Qui (@) = —ia, 
and, similarly, 
(Hip) = i, 
$0 [4101 = 5. A similar computation shows that 
(pila) =ia = and = (py fei )(1) = I. 


Thus p14; = 51,80 0141 # (4191 and G(K /Q) is not abelian. Therefore, G(K /Q) must 
be isomorphic to one of the two nonabelian groups of order 8 described in Exam- 
ple 40.6. Computing from Table 54.5, we see that ; is of order 4, 4; is of order 2, 
{o1, 41} generates G(K/Q), and pip = 41,2 = 61. Thus G(K /Q) is isomorphic to 
the group G; of Example 40.6, the octic group. We chose our notation for the elements 
of G(K /Q) so that its group table would coincide with the table for the octic group 
in Table 8.12. The diagram of subgroups H; of G(K /Q) is that given in Fig. 8.13. We 
repeat it here in Fig. 54.6 and also give the corresponding diagram of intermediate fields 
between Q and K. This finally illustrates nicely that one diagram is the inversion of the 
other. 

The determination of the fixed fields Ky, sometimes requires a bit of ingenuity. 
Let’s illustrate. To find Ky,, we merely have to find an extension of Q of degree 2 left 
fixed by {0, 1, 02. 03}. Since all p; leave i fixed, Q(i) is the field we are after. To find 
Ky,, we have to find an extension of Q of degree 4 left fixed by pp and j4;. Since 4 
leaves @ fixed and @ is a zero of irr(a, Q) = x* — 2, we see that Q(a) is of degree 4 over 
Q and is left fixed by {9, 41}. By Galois theory, it is the only such field. Here we are 
using strongly the one-to-one correspondence given by the Galois theory. If we find one 
field that fits the bill, it is the one we are after. Finding Ky, requires more ingenuity. 
Since H7 = {0, 5,} is a group, for any B € K we see that p(B) + 5,(8) is left fixed by 
fo and 8,. Taking 6 = a, we see that po(a~) + $\(~) = a@ + io is left fixed by H7. We 
can check and see that po and 6, are the only automorphisms leaving a + ia fixed. Thus 
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G(K/Q) 
A, = {Po Pas Hs Ka} Hy = {09 Py, 2, 03} H3= {Po 2, 81, 89} 
a 
Hs= {£o, Hi} Hs = {P, 9} He = {60 pat H, = {p; 8} Hyg = {o, 53} 
{9} 


(a) 


QW/2, ) = K = Kip) 


QWD=Ky OGD =Ky Q/2, ) = Ky. QW2+i2)= Ky Q4/2 — i/2) = Ky, 


Sree 


V2) = Ky, QO = ky, QEV2) = Ky, 
Q= Kex/@ 
(b) 


54.6 Figure (a) Group diagram. (b) Field diagram. 


by the one-to-one correspondence, we must have 
Qla + ia) = QW/2 + iV2) = Kuy,. 


Suppose we wish to find irr(@ + ia, Q). Ify = a + ia, then for every conjugate of y 
over Q, there exists an automorphism of K mapping y into that conjugate. Thus we need 
only compute the various different values o(y) foro € G(K /Q) to find the other zeros 
of irr(y, Q). By Theorem 53.2, elements o of G(K /Q) giving these different values can 
be found by taking a set of representatives of the left cosets of G(K /Q(y)) = {p0, 81} 
in G(K /Q). A set of representatives for these left cosets is 


{P0, 01; 2s 23}. 
The conjugates of y = a + io are thus a + ia, ia — a, —a — ia, and ~ia + a. Hence 
i(y, Q) = [(* — (@ t+ia))(x — Ga — @))] 
[a — (—@ — ia))(x — (-ia + a))] 
= (x* — 2iax — 2a)(x* + 2iax — 207) 
=x4+4+4ot=xt+8. A 


54.7 Example 
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We have seen examples in which the splitting field of a quartic (4th degree) poly- 
nomial over a field F is an extension of F of degree 8 (Example 54.3) and of degree 24 
(Theorem 54.2, with n = 4). The degree of an extension of a field F that is a splitting 
field of a quartic over F must always divide 4! = 24. The splitting field of (x — 2)* over 
Q is Q, an extension of degree 1, and the splitting field of (x? — 2)? over Q is Q(V2), 
an extension of degree 2. Our last example will give an extension of degree 4 for the 
splitting field of a quartic. 


Consider the splitting field of x* + 1 over Q. By Theorem 23.11, we can show that x* + 1 
is irreducible over Q, by arguing that it does not factor in Z[x]. (See Exercise 1.) The 
work on complex number in Section 1 shows that the zeros of x4 +1 are (1 +i)/ /2 
and (—1 +i)//2. A computation shows that if 


_ iti 
J2’ 
then 
-l+i —-l-i 1-i 
3 5 7 
ex : a ‘ and a= —. 
J2 J2 J/2 


Thus the splitting field K of x+ + 1 over Q is Q(a), and [K : Q] = 4. Let us compute 
G(K /Q) and give the group and field diagrams. Since there exist automorphisms of K 
mapping & onto each conjugate of w, and since an automorphism a of Q(@) is completely 
determined by o (a), we see that the four elements of G(K /Q) are defined by Table 54.8. 
Since* 


(ojox (a) = 0;(a*) = (a/)k = a 


and aw® = 1, we see that G(K /Q) is isomorphic to the group {1, 3, 5, 7} under multi- 
plication modulo 8. This is the group Gs of Theorem. 20.6. Since oF = 0}, the identity, 
for all 7, GK /Q) must be isomorphic to the Klein 4-group. The diagrams are given in 
Fig. 54.9. 

To find Kjs,,0,}, it is only necessary to find an element of K not in Q left fixed by 
{o1, 03}, since [Ki,,,o,) : Q] = 2. Clearly o;(a@) + 03(@) is left fixed by both o; and 03, 
since {o), 03} is a group. We have 


o1(a@) + 03(a) =a+ oe =iV2. 
Similarly, 
o,(ot) + oy(0) =a +a? = V2 


54.8 Table 
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GK/Q 
| 
{o,, 03} {o;, 05} {o 1, 07} 
oe 
Oe oi} 
(a) 
oftgt)ax 
ae 
QiVD=Kio¢ W=Keo,,o, QW2) = Kig,,0,) 
Q= Kexigy 


(b) 


54.9 Figure (a) Group diagram. (b) Field diagram. 


is left fixed by {o), 07}. This technique is of no use in finding Ej, ,,}, for 
o(a) + o5(a) =a+o°=0, 


and 0 € Q. But by a similar argument, 0; (a@)os5(q) is left fixed by both oj and os, and 


01(@)o5(a) = aa? = —i. 


Thus Q(—i) = Q(i) is the field we are after. A 


@ EXERCISES 54 


Computations (requiring more than the usual amount of theory) 

1. Show that x* + 1 is irreducible in Q[x], as we asserted in Example 54.7. 

2. Verify that the intermediate fields given in the field diagram in Fig. 54.6 are correct (Some are verified in the 
text. Verify the rest.) 

3. For each field in the field diagram in Fig. 54.6, find a primitive element generating the field over Q (see 
Theorem 51.15 and give its irreducible polynomial over Q. 

4. Let ¢ be a primitive 5th root of unity in C. 
a. Show that Q(¢) is the splitting field of x° — 1 over Q. 
b. Show that every automorphism of K = Q(¢) maps ¢ onto some power ¢” of ¢. 
c. Using part (b), describe the elements of G(K /Q). 


d. Give the group and field diagrams for Q(¢) over Q, computing the intermediate fields as we did in Exam- 
ples 54.3 and 54.7. 


10. 
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. Describe the group of the polynomial (x° — 2) € (Q(¢))[x] over Q(c), where ¢ is a primitive 5th root 


of unity. 


. Repeat Exercise 4 for ¢ a primitive 7th root of unity in C. 


. In the easiest way possible, describe the group of the polynomial 


(x8 — 1) € Qix] 


over Q. 


. Find the splitting field K in C of the polynomial (x* — 4x? — 1) € Q[x]. Compute the group of the polynomial 


over Q and exhibit the correspondence between the subgroups of G(K /Q) and the intermediate fields. In other 
words, do the complete job. 


. Express each of the following symmetric functions in y;, y2, y3 over Q as a rational function of the elementary 


symmetric functions s1, 52, $3. 
a. yi? + yo? + ys? 
poe ee Ba 
yz I 3 ¥3 «Y2 
Let a, @2, a3 be the zeors in C of the polynomial 


(x3 — 4x? + 6x — 2) € Q[x]. 
Find the polynomial having as zeros precisely the following: 


a. a; a2 + 03 


2 2 2 
b. ay, @2°, a3 


Theory 


11. 


12. 


13. 


Show that every finite group is isomorphic to some Galois group G(K /F) for some finite normal extension K 
of some field F. 


Let f(x) € F[x] be a monic polynomial of degree n having all its irreducible factors separable over F’. Let 
K < F be the splitting field of f(x) over F, and suppose that f(x) factors in K [x] into 


n 
[ce — a). 
i=] 
Let 


A(f) = [ ]@: - a); 
i<j 
the product (A(f))* is the discriminant of f(x). 


a. Show that A(/) = Oif and only if f(x) has as a factor the square of some irreducible polynomial in F [x]. 
b. Show that (A(f) ¢€ F. 


c. G(K/F) may be viewed as a subgroup of 5, where S, is the group of all permutations of {a; |i = 1,---, 7}. 
Show that G(K/F), when viewed in this fashion, is a subgroup of A,, the group formed by all even 
permutations of {a; |i = 1,---, n}, if and only if A(f) € F. 


An element of C is an algebraic integer if it is a zero of some monic polynomial in Z[x]. Show that the set of 
ali algebraic integers forms a subring of C. 


464 Part X Automorphisms and Galois Theory 


CYCLOTOMIC EXTENSIONS 


The Galois Group of a Cyclotomic Extension 


This section deals with extension fields of a field F obtained by adjoining to F some 
roots of unity. The case of a finite field F was covered in Section 33, so we shall be 
primarily concerned with the case where F is infinite. 


55.1 Definition 


The splitting field of x” — 1 over F is the nth cyclotomic extension of F. | 


Suppose that F is any field, and consider (x” — 1) € F [x]. By long division, as in the 
proof of Lemma 33.8, we see that if a is a zero of x” — 1 and g(x) = (x” — 1)/(@ — a), 
then g(a) = (n- 1)(t/a) ¥ 0, provided that the characteristic of F does not divide n. 
Therefore, under this condition, the splitting field of x" — 1 is a separable and thus a 


normal extension of F’. 


@ HistoricaL Note 


arl Gauss considered cyclotomic polynomi- 

als in the final chapter of his Disquisitiones 
Arithmeticae of 1801. In that chapter, he gave a 
constructive procedure for actually determining 
the roots of ®,(x) in the case where p is prime. 
Gauss’s method, which became an important ex- 
ample for Galois in the development of the general 
theory, was to solve a series of auxiliary equations, 
each of degree a prime factor of p — 1, with the 
coefficients of each in turn being determined by the 
roots of the previous equation. Gauss, of course, 
knew that the roots of ®,(x) were all powers of 
one of them, say ¢. He determined the auxiliary 
equations by taking certain sets of sums of the 
roots ¢7, which were the desired roots of these 
equations. For example, in the case where p = 19 
(and p—1=18 =3 x3 x 2), Gauss needed to 
find two equations of degree 3 and one of degree 2 


as his auxiliaries. It turned out that the first one 
had the three roots, a, =¢+¢%8+¢74 ¢84 
Si Gee eet Si a ae eee a 
anda; = ¢*#,+¢8 46% +66 4 664 ©! Ip fact, 
these three values are the roots of the cubic equa- 
tion x? + x? — 6x — 7. Gauss then found a second 
cubic equation, with coefficients involving the w’s, 
whose roots were sums of two of the powers of ¢, 
and finally a quadratic equation, whose coefficients 
involved the roots of the previous equation, which 
had ¢ as one of its roots. Gauss then asserted (with- 
out a complete proof) that each auxiliary equation 
can in turn be reduced to an equation of the form 
x™ — A, which clearly can be solved by radicals. 
That is, he showed that the solvability of the Galois 
group in this case, the cyclic group of order p — 1, 
implied that the cyclotomic equation was solvable 
in terms of radicals. (See Section 56.) 


Assume from now on that this is the case, and let K be the splitting field of x" — 1 
over F’, Then x” — 1 has n distinct zeros in K, and by Corollary 23.6, these form a cyclic 
group of order n under the field multiplication. We saw in Corollary 6.16 that a cyclic 
group of order n has y(n) generators, where g is the Euler phi-function introduced prior 
to Theorem 20.8. For our situation here, these y(n) generators are exactly the primitive 
nth roots of unity. 


55.2 Definition 


55.3 Example 
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The polynomial 


en) 


®,(x) = | [@ - a) 
i=l 


where the a; are the primitive nth roots of unity in F, is the nth cyclotomic polynomial 
over F, a 


Since an automorphism of the Galois group G(K /F) must permute the primitive 
nth roots of unity, we see that ®,,(x) is left fixed under every element of G(K/F) 
regarded as extended in the natural way to K [x]. Thus ®,(x) € F[x]. In particular, for 
F =Q, ®, (x) € Qf], and ©, (x) is a divisor of x” — 1. Thus over Q, we must actually 
have ®,,(x) € Z[x], by Theorem 23.11. We have seen that ©, (x) is irreducible over Q, 
in Corollary 23.17. While ®,(%) need not be irreducible in the case of the fields Zp, it 
can be shown that over Q, ®,(x) is irreducible. 

Let us now limit our discussion to characteristic 0, in particular to subfields of the 
complex numbers. Let i be the usual complex zero of x” + 1. Our work with complex 
numbers in Section 1 shows that 


20 Qn \” 
(cos +isin =) = cos2a +isin2z = 1, 
n A 


so cos(27/n) +i sin(2Qz/n) is an nth root of unity. The least integer m such that 
(cos(27 /n) + i sin(22/n))" = 1 isn. Thus cos(27/n) + i sinQa/n) is a primitive nth 
root of unity, a zero of 


$,(x) € Qia]. 
A primitive 8th root of unity in C is 
cos zi + isin zt 
— — L — 
$ 8 8 

TG 20 ine ETE 
= cos — +1 sin — 

4 


4 
ll ok 1+i 


= + E = ; 
ee a2 52 
By the theory of cyclic groups, in particular by Corollary 6.16 all the primitive 8th roots 
of unity in Q are ¢, ¢7, ¢°, and £7, so 
a(x) = —O)A— SV - OY — 87). 


We can compute, directly from this expression, ®g(x) = x+ 4+ 1 (see Exercise 1). Com- 
pare this with Example 54.7. A 


Let us still restrict our work to F = Q, and let us assume, without proof, that ®, (x) 
is irreducible over Q. Let 


Qn |, 2a 
= cos — +i sin —, 
n n 
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55.4 Theorem 


55.5 Example 


55.6 Corollary 


Proof 


Automorphisms and Galois Theory 


so that ¢ is a primitive nth root of unity. Note that ¢ is a generator of the cyclic mul- 
tiplicative group of order n consisting of all nth roots of unity. All the primitive nth 
roots of unity, that is, all the generators of this group, are of the form ¢” forl <m <n 
and m relatively prime to n. The field Q(¢) is the whole splitting field of x” — 1 over Q. 
Let K = Q(¢). If ¢” is another primitive nth root of unity, then since ¢ and ¢” are con- 
jugate over Q, there is an automorphism 1, in G(K/Q) mapping ¢ onto ¢”. Let 1, be 
the similar automorphism in G(K /Q) corresponding to a primitive nth root of unity ¢’. 
Then 


(mt )(G) = Tm(6") = (Tm(S)Y" a (cy = CN, 


This shows that the Galois group G(K /Q) is isomorphic to the group G,, of Theorem 20.6 
consisting of elements of Z, relatively prime to n under multiplication modulo n. This 
group has g(7) elements and is abelian. 

Special cases of this material have appeared several times in the text and exercises. 
For example, a of Example 54.7 is a primitive 8th root of unity, and we made arguments 
in that example identical to those given here. We summarize these results in a theorem. 


The Galois group of the nth cyclotomic extension of Q has y(n) elements and is isomor- 
phic to the group consisting of the positive integers less than n and relatively prime to n 
under multiplication modulo n. 


Example 54.7 illustrates this theorem, for it is easy to see that the splitting field of 
x4 + 1 is the same as the splitting field of x8 — 1 over Q. This follows from the fact that 
s(x) = x* + 1 (see Example 55.3 and Exercise 1). A 


The Galois group of the pth cyclotomic extension of Q for a prime p is cyclic of order 
p-l. 


By Theorem 55.4, the Galois group of the pth cyclotomic extension of Q has g(p) = 
p — 1 elements, and is isomorphic to the group of positive integers less than p and rela- 
tively prime to p under multiplication modulo p. This is exactly the multiplicative group 
(Z,*, -) of nonzero elements of the field Z, under field multiplication. By Corollary 23.6, 
this group is cyclic. . 4 


Constructible Polygons 


We conclude with an application determining which regular n-gons are constructible 
with a compass and a straightedge. We saw in Section 32 that the regular n-gon is 
constructible if and only if cos(27/n) is a constructible real number. Now let 

2x |. 2a 
¢ =cos — +i sin —. 
n n 


Then 


1 2x | , 2a 
— =cos — —isin—, 
i? n n 
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2x |, On 2m |, 2k 220, 4 2 
cos — + isin — }( cos — —isin — } = cos* — +sin®° — =1. 
n n n n n n 
But then 
1 20 
f¢+-—=2cos—. 
c n 


Thus Corollary 32.8 shows that the regular n-gon is constructible only if ¢ + 1/¢ gen- 
erates an extension of Q of degree a power of 2. 

If K is the splitting field of x" — 1 over Q, then [K : Q] = y(n), by Theorem 55.4. 
Ifo <¢ G(K/Q and o(f) = ¢', then 


( Qnr =") ( Qnr =) 
= | cos —— +7 sin —— } + | cos —— —1 Sin — 
n nh n n 


2nr 
= 2cos —. 
n 


Butforl <r <n, wehave2cos(2zr/n) = 2. cos(27/n) only in the case thatr =n — 1. 
Thus the only elements of G(K /Q) carrying ¢ + 1/¢ onto itself are the identity automor- 
phism and the automorphism rt, with t(¢) = c”—! = 1/t. This shows that the subgroup 
of G(K /Q) leaving Q(é + 1/¢) fixed is of order 2, so by Galois theory, 


ls+$) a= 


Hence the regular n-gon is constructible only if (n)/2, and therefore also (n), is a 
power of 2. 
Jt can be shown by elementary arguments in number theory that if 


St 


n= 2" py s+ pr, 
where the p; are the distinct odd primes dividing n, then 
y(n) = 2! ppt pp — D(a = De (a) 
If y(n) is to be a power of 2, then every odd prime dividing n must appear only to the 
first power and must be one more than a power of 2. Thus we must have each 
pe=2"4+1 


for some m. Since —1 is a zero of x? + 1 for g an odd prime, x + 1 divides x? + 1 for 
q an odd prime. Thus, if m = gu, where q is an odd prime, then 2” + 1 = (2")! + 1 is 
divisible by 2" + 1. Therefore, for p; = 2” + 1 to be prime, it must be that m is divisible 
by 2 only, so p; has to have the form 


pi =27 41, 
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55.7 Example 


Automorphisms and Galois Theory 


a Fermat prime. Fermat conjectured that these numbers 2% + 1 were prime for all 
nonnegative integers &. Euler showed that while k = 0, 1, 2, 3, and 4 give the primes 3, 
5, 17, 257, and 65537, for k = 5, the integer 2) +4 1 is divisible by 641. It has been 
shown that for 5 < k < 19, all the numbers ge) + 1 are composite. The case k = 20 is 
still unsolved as far as we know. For at least 60 values of k greater than 20, including 
k = 9448, it has been shown that 2 4 Lis composite. Itis unknown whether the number 
of Fermat primes is finite or infinite. 

We have thus shown that the only regular n-gons that might be constructible are 
those where the odd primes dividing n are Fermat primes whose squares do not divide n. 
In particular, the only regular p-gons that might be constructible for p a prime greater 
than 2 are those where p is a Fermat prime. 


The regular 7-gon is not constructible, since 7 is not a Fermat prime. Similarly, the 
regular 18-gon is not constructible, for while 3 is a Fermat prime, its square divi- 
des 18. A 


It is a fact that we now demonstrate that all these regular n-gons that are candidates 
for being constructible are indeed actually constructible. Let ¢ again be the primitive nth 
root of unity cos(27/n) + i sin(277/n). We saw above that 


2n 1 
2cos— =f+-, 
n g 


oso) -a]-22 


Suppose now that g(7) is a power 2° of 2. Let E be Q(¢ + 1/f). We saw above that 
Q(é + 1/¢) is the subfield of K = Q(¢) left fixed by H, = {t, t}, where ¢ is the identity 
element of G(K /Q) and t(¢) = 1/¢. By Sylow theory, there exist additional subgroups 
H; of order 2/ of G(Q(¢)/Q) for j = 0, 2,3, ---, s such that 


and that 


{a} = Ao < A, a Wale Hy, = GQ(z)/Q. 
By Galois theory, 
1 
Q= Ky, < Ku,, << Kn =Q(r +7), 


and[Ky,, : Ky] = 2.Notethat(¢ + 1/¢) € R,soQ(é +1/¢) < R.IfKy,_, = Ku,(a;), 
then a; is a zero of some (ajx* + bjx +c¢;) © Ky, [x]. By the familiar “quadratic for- 


mula,” we have 
Ku, = Ku, (y b? = 4ajc;). 


Since we saw in Section 33 that construction of square roots of positive constructible 
numbers can be achieved by a straightedge and a compass, we see that every element in 


2. 


7. 
8. 
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Q(¢ + 1/¢), in particular cos(27/n), is constructible. Hence the regular n-gons where 
g(n) is a power of 2 are constructible. 
We summarize our work under this heading in a theorem. 


55.8 Theorem The regular n-gon is constructible with a compass and a straightedge if and only if all 
the odd primes dividing n are Fermat primes whose squares do not divide n. 


55.9 Example The regular 60-gon is constructible, since 60 = (27)(3)(5) and 3 and 5 are both Fermat 
primes. A 


@ EXERCISES 55 


Computations 
1. 


Referring to Example 55.3, complete the indicated computation, showing that Og(x) = x*+ + 1. [Suggestion: 
Compute the product in terms of ¢, and then use the fact that ¢ 8 — 1 and ¢* = —1 to simplify the coefficients.] 


Classify the group of the polynomial (x?° — 1) € Q[x] over Q according to the Fundamental Theorem of 
finitely generated abelian groups. [Hint: Use Theorem 55.4.] 


. Using the formula for y(n) in terms of the factorization of n, as given in Eq. (1), compute the indicated value: 


a. (60) b. g(1000) c. 9(8100) 


4, Give the first 30 values of n > 3 for which the regular n-gon is constructible with a straightedge and a compass. 


5, Find the smallest angle of integral degree, that is, 1°, 2°, 3°, and so on, constructible with a straightedge and a 


compass. [Hint: Constrycting a 1° angle amounts to constructing the regular 360-gon, and so on.] 


. Let K be the splitting field of x'* — 1 over Q. 


a. Find [K : Q]. 
b. Show that for a € G(K/Q), a? is the identity automorphism. Classify G(K /Q) according to the Funda- 
mental Theorem 11.12 of finitely generated abelian groups. 


Find ©3(x) over Z. Find ®g(x) over Z3. 


How many elements are there in the splitting field of x° — 1 over Z3? 


Concepts 
9. 


Mark each of the following true or false. 


a. ©,(x) is irreducible over every field of characteristic 0. 


b. Every zero in C of ®,(x) is a primitive nth root of unity. 

. The group of ®,(x) € Q[x] over Q has order n. 

. The group of ®,,(x) € Q[x] over Q is abelian. 

. The Galois group of the splitting field of ©, (x) over Q has order p(1). 
. The regular 25-gon is constructible with a straightedge and a compass. 


. The regular 17-gon is constructible with a straightedge and a compass. 
. Fora prime p, the regular p-gon is constructible if and only if p is a Fermat prime. 


. All integers of the form 22) 4.1 for nonnegative integers k are Fermat primes. 
. All Fermat primes are numbers of the form 2” + 1 for nonnegative integers k. 


—_—_ mw SO sm» oa 
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Theory 


Automorphisms and Galois Theory 


10. Show that if F is a field of characteristic not dividing n, then 


11. 
12. 
13. 


14 


15. 


x 1a II @4(x) 


d\n 


in F [x], where the product is over all divisors d of n. 


Find the cyclotomic polynomial ®,(x) over Q for n = 1, 2, 3, 4, 5, and 6. [Hint: Use Exercise 10.] 
Find 1(x) in Q[x]. [Hint: Use Exercises 10 and 11.] 
Show that in Q[x], &2,(x) = ®,(—x) for odd integers mn > 1. [Hint: If ¢ is a primitive nth root of unity for n 


odd, what is the order of —¢?] 


'. 


Let n,m € Z* be relatively prime. Show that the splitting field in C of x”” — 1 over Q is the same as the 


splitting field in C of (x” — 1)(x” — 1) over Q. 


Let n,m € Z* be relatively prime. Show that the group of (x””" — 1) € Q{x] over Q is isomorphic to the direct 


product of the groups of (x” — 1) € Q[x] and of (x” — 1) € Q[x] over Q. [Hint: Using Galois theory, show 
that the groups of x” — 1 and x” — 1 can both be regarded as subgroups of the group of x”” — 1. Then use 
Exercises 50 and 51 of Section 11.] 


56.1 Definition 


INSOLVABILITY OF THE QUINTIC 


The Problem 


We are familiar with the fact that a quadratic polynomial f(x) = ax* + bx +c,a 40, 
with real coefficients has (—b + Vb? — 4ac)/2a as zeros in C. Actually, this is true 
for f(x) € F[x], where F is any field of characteristic 4 2 and the zeros are in F. 
Exercise 4 asks us to show this. Thus, for example, (x? + 2x + 3) € Q[x] has its zeros 
in Q(./—2). You may wonder whether the zeros of a cubic polynomial over Q can 
also always be expressed in terms of radicals. The answer is yes, and indeed, even the 
zeros of a polynomial of degree 4 over Q can be expressed in terms of radicals. After 
mathematicians had tried for years to find the “radical formula” for zeros of a 5th degree 
polynomial, it was a triumph when Abel proved that a quintic need not be solvable by 
radicals. Our first job will be to describe precisely what this means. A large amount of 
the algebra we have developed is used in the forthcoming discussion. 


Extensions by Radicals 


An extension K of a field F is an extension of F by radicals if there are elements 
a@,---,a, € K and positive integers m1, ---,, such that K = F(a,---,a,), ay EF 
and a; € F(o,---,a;-1) for 1 <i <r. A polynomial f(x) € F[x] is solvable by 
radicals over F if the splitting field E of f(x) over F is contained in an extension of F 
by radicals. a 


A polynomial f(x) € F(x) is thus solvable by radicals over F if we can obtain 
every zero of f(x) by using a finite sequence of the operations of addition, subtraction, 
multiplication, division, and taking n;th roots, starting with elements of F. Now to say 
that the quintic is not solvable in the classic case, that is, characteristic 0, is not to say 
that no quintic is solvable, as the following example shows. 
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@ HISTORICAL NOTE 


he first publication of a formula for solving cu- 

bic equations in terms of radicals was in 1545 
in the Ars Magna of Girolamo Cardano, although 
the initial discovery of the method is in part also 
due to Scipione del Ferro and Niccolo Tartaglia. 
Cardano’s student, Lodovico Ferrari, discovered a 
method for solving quartic equations by radicals, 
which also appeared in Cardano’s work. 

After many mathematicians had attempted to 
solve quintics by similar methods, it was Joseph- 
Louis Lagrange who in 1770 first attempted a de- 
tailed analysis of the general principles underlying 
the solutions for polynomials of degree 3 and 4, and 
showed why these methods fail for those of higher 
degree. His basic insight was that in the former 
cases there were rational functions of the roots that 
took on two and three values, respectively, under all 


possible permutations of the roots, hence these ra- 
tional functions could be written as roots of equa- 
tions of degree less than that of the original. No 
such functions were evident in equations of higher 
degree. 

The first mathematician to claim to have a proof 
of the insolvability of the quintic equation was Paolo 
Ruffini (1765-1822) in his algebra text of 1799. His 
proof was along the lines suggested by Lagrange, 
in that he in effect determined all of the subgroups 
of S; and showed how these subgroups acted on 
rational functions of the roots of the equation. Un- 
fortunately, there were several gaps in his various 
published versions of the proof. It was Niels Henrik 
Abel who, in 1824 and 1826, published a complete 
proof, closing all of Ruffini’s gaps and finally set- 
tling this centuries-old question. 


56.2 Example The polynomial x° — 1 is solvable by radicals over Q. The splitting field K of wal 
| is generated over Q by a primitive Sth root ¢ of unity. Then ¢° = 1, and K = Q(¢). 
Similarly, x° — 2 is solvable by radicals over Q, for its splitting field over Q is generated 
by 2 and ¢, where ~/2 is the real zero of x? — 2. A 


To say that the quintic is insolvable in the classic case means that there exists 
some polynomial of degree 5 with real coefficients that is not solvable by radicals. 
i We shall show this. We assume throughout this section that all fields mentioned have 
characteristic 0. 

The outline of the argument is as follows, and it is worthwhile to try to remember 
it. 

1. We shall show that a polynomial f(x) € F [x] is solvable by radicals over F 

(if and) only if its splitting field E over F has a solvable Galois group. Recall 
that a solvable group is one having a composition series with abelian 
quotients. While this theorem goes both ways, we shall not prove the “if” part. 


2. We shall show that there is a subfield F of the real numbers and a polynomial 
f(x) € F[x] of degree 5 with a splitting field E over F such that G(E/F)= 8s, 
the symmetric group on 5 letters. Recall that a composition series for S5 is 
{t} < As < Ss. Since As is not abelian, we will be done. 


The following lemma does most of our work for Step 1. 


56.3 Lemma _ Let F be a field of characteristic 0, and let a € F. If K is the splitting field of x” —a 
over F, then G(K /F) is a solvable group. 
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Proof 


56.4 Theorem 


Proof 


Automorphisms and Galois Theory 


Suppose first that F contains all the nth roots of unity. By Corollary 23.6 the nth roots of 
unity form a cyclic subgroup of (F*, -). Let ¢ be a generator of the subgroup. (Actually, 
the generators are exactly the primitive nth roots of unity.) Then the nth roots of unity 
are 


1, G: eS es gk 


If 8 € Fisazero of (x" — a) € F[x], then all zeros of x" — a are 


BGR OBB. 

Since K = F(f), an automorphism o in G(K/F)is determined by the value o (6) of the 
automorphism o on f. Now if o(8) = ¢'B and t(8) = ¢/ 8, where t € G(K/F), then 
(ta )(B) = t(o(B)) = (6'B) = 6'r(B) = 66/8, 

since ¢' € F. Similarly, 
(ot )(B) = ¢7¢'B. 


Thus ot = to, and G(K /F) is abelian and therefore solvable. 

Now suppose that F does not contain a primitive nth root of unity. Let ¢ be a generator 
of the cyclic group of nth roots of unity under multiplication in F. Let 6 again be a zero 
of x” — a. Since 8 and ¢8 are both in the splitting field K of x” — a, ¢ = (€B)/f is in 
K. Let F’ = F(¢), so we have F < F’ < K. Now F’ is a normal extension of F, since 
F’ is the splitting field of x" — 1. Since F’ = F(¢), an automorphism n in G(F’/F) is 
determined by n(¢), and we must have 7(¢) = ¢/ for some /, since all zeros of x” — 1 
are powers of ¢. If uw(¢) = ¢/ for u € G(F'/F), then 


(uno) = MINE) = MES) = MEY = GY = OM, 
and, similarly, 
(nung) =o". 
Thus G(F’/F) is abelian. By the Main Theorem of Galois Theory, 
{i} < G(K/F’) < G(K/F) 


is anormal series and hence a subnormal series of groups. The first part of the proof shows 
that G(K / F’) is abelian, and Galois theory tells us that G(K / F)/ G(K / F'’) is isomorphic 
to G(F’/F), which is abelian. Exercise 6 shows that if a group has a subnormal series of 
subgroups with abelian quotient groups, then any refinement of this series also has abelian 
quotient groups. Thus a composition series of G(K/F) must have abelian quotient 
groups, so G(K /F) is solvable. ¢ 


The following theorem will complete Part 1 of our program. 


Let F be a field of characteristic zero, and let F < E < K < F, where E is a normal 
extension of F and K is an extension of F by radicals. Then G(E/F) is a solvable group. 


We first show that K is contained in a finite normal extension L of F by radicals and that 
the group G(L/F) is solvable. Since K is an extension by radicals, K = F(a ,+-+, a) 
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where ay" € F(a), -+-, @;-1) for 1 <i <r anda}! € F. To form L, we first form the 
splitting field L, of fi(x) = x” — a}! over F. Then L; is a normal extension of F, and 
Lemma 56.3 shows that G(L1/F) is a solvable group. Now «;” € L; and we form the 
polynomial 


Aix)= [] (&?-o@)”). 
w€G(Li/F) 
Since this polynomial is invariant under action by any o in G(L)/F), we see that 
f(x) © F[x]. We let L be the splitting field of f(x) over Ly. Then Lz is a splitting 
field over F also and is a normal extension of F by radicals. We can form L2 from 
L; via repeated steps as in Lemma 56.3, passing to a splitting field of x”? — o(a2) 
at each step. By Lemma 56.3 and Exercise 7, we see that the Galois group over F of 
each new extension thus formed continues to be solvable. We continue this process of 
forming splitting fields over F in this manner: At stage i, we form the splitting field of 
the polynomial 
fAw@= JY] (@-o@)"] 
aeG(L;-1/F) 
over L;_1. We finally obtain a field L = L, that is a normal extension of F by radicals, 
and we see that G(L/F) is a solvable group. We see from construction that K < L. 

To conclude, we need only note that by Theorem 53.6, we have G(E/F) = G(L/F)/ 
G(L/E). Thus G(E/F) is a factor group, and hence a homomorphic image, of G(L/F). 
Since G(L/F) is solvable, Exercise 29 of Section 35 shows that G(E/F) is solvable. 

a 
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It remains for us to show that there is a subfield F of the real numbers and a polynomial 
f(x) € F[x] of degree 5 such that the splitting field E of f(x) over F has a Galois group 
isomorphic to Ss. 

Let y; € R be transcendental over Q, y2 € R be transcendental over Q(y), and 
so on, until we get ys ¢ R transcendental over Q(y1,---, y4). It can be shown by a 
counting argument that such transcendental real numbers exist. Transcendentals found in 
this fashion are independent transcendental elements over Q. Let E = Q(y1,-++, Ys). 
and let 


5 
f@) =[]@-y. 
i=1 


Thus f(x) € E[x]. Now the coefficients of f(x) are, except possibly for sign, among 
the elementary symmetry functions in the y;, namely 


Sp =Y+yot-+ + ys, 
52 = yiy2 + yiy3 + yiy4 + yiys + y2¥3 
+yoy4 + yoys + ¥3¥4 + y3ys + Vays, 


S5 = Yi y2y¥3 Y4y5- 
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B= QO, 5 ¥5) 


F = Q(s;, ...5 55) 


Q 
56.5 Figure 


56.6 Theorem 


Automorphisms and Galois Theory 


The coefficient of x' in f(x) is tss_;. Let F = Q(s,, 52, --- , 55); then f(x) € F[x] (see 
Fig. 56.5). Then E is the splitting field over F of f(x). Since the y; behave as indeter- 
minates over Q, for each o € Ss, the symmetric group on five letters, o induces an auto- 
morphism @ of E defined by o(a) = a for a € Q and G(y;) = you. Since T7_,(x — y;) 
is the same polynomial as T1?_,(« — yoq)), we have 


a(s;) = 8; 
for each i, so @ leaves F fixed, and hence  € G(E/F). Now S; has order 5!, so 
|G(E/F)| > 51. 


Since the splitting field of a polynomial of degree 5 over F has degree at most 5! over 
F, we see that 


IG(E/F)| <5! 


Thus |G(£/F)| = 5!, and the automorphisms 6 make up the full Galois group G(E/F). 
Therefore, G(E/F) = Ss, so G(E/F) is not solvable. This completes our outline, and 
we summarize in a theorem. 


Let y1,-+-, ys be independent transcendental real numbers over Q. The polynomial 
5 
f@)=[[@-y 
irl 


is not solvable by radicals over F = Q(s,,--+, 55), where s; is the ith elementary sym- 
metric function in yj,---, ys. 


it is evident that a generalization of these arguments shows that (final goal) a poly- 
nomial of degree 7 need not be solvable by radicals for n > 5. 

In conclusion, we comment that there exist polynomials of degree 5 in Q[x] that 
are not solvable by radicals over Q. A demonstration of this is left to the exercises (see 
Exercise 8). 


jm EXERCISES 56 


Concepts 


1. Can the splitting field K of x? + x + 1 over Zz be obtained by adjoining a square root to Z» of an element in 
Zz? 1s K an extension of Z by radicals? 


2. Is every polynomial in F[x] of the form ax® + bx® + cx + dx? + e, where a ¥ 0, solvable by radicals over 
F, if F is of characteristic 0? Why or why not? 


3. Mark each of the following true of false. 


a. Let F be a field of characteristic 0. A polynomial in F [x] is solvable by radicals if and only if its 


splitting field in F is contained in an extension of F by radicals. 


b. Let F be a field of characteristic 0. A polynomial in F[x] is solvable by radicals if and only if its 


splitting field in F has a solvable Galois group over F. 
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. The splitting field of x!? — 5 over Q has a solvable Galois group. 

. The numbers x and ./7 are independent transcendental numbers over Q. 

. The Galois group of a finite extension of a finite field is solvable. 

. No quintic polynomial is solvable by radicals over any field. 

. Every 4th degree polynomial over a field of characteristic 0 is solvable by radicals. 

. The zeros of a cubic polynomial over a field F of characteristic 0 can always be attained by means of 
a finite sequence of operations of addition, subtraction, multiplication, division, and taking square 
roots starting with elements in F’. 

i. The zeros of a cubic polynomial over a field F of characteristic 0 can never be attained by means of 

a finite sequence of operations of addition, subtraction, multiplication, division, and taking square 

roots, starting with elements in F. 


S79 =m © a Oo 


j. The theory of subnormal series of groups play an important role in applications of Galois theory. 


Theory 


4 


* 


Let F be a field, and let f(x) = ax? +bx +c be in F[x], where a 4 0. Show that if the characteristic of F 
is not 2, the splitting field of f(x) over F is F(.Vb? — 4ac). [Hint: Complete the square, just as in your high 
school work, to derive the “quadratic formula.” ] 


. Show that if F is a field of characteristic different from 2 and 


f@= ax’ +bx* +c, 
where a 4 0, then f(x) is solvable by radicals over F. 


. Show that for a finite group, every refinement of a subnormal series with abelian quotients also has abelian 


quotients, thus completing the proof of Lemma 56.3. [Hint: Use Theorem 34.7.] 


. Show that for a finite group, a subnormal series with solvable quotient groups can be refined to a composition 


series with abelian quotients, thus completing the proof of Theorem 56.4. [Hint: Use Theorem 34.7.] 


. This exercise exhibits a polynomial of degree 5 in Q[x] that is not solvable by radicals over Q. 


a. Show that if a subgroup H of $5 contains a cycle of length 5 and a transposition t, then H = S5. [Hint: 
Show that H contains every transposition of Ss and apply Corollary 9.12. See Exercise 39, Section 9.] 

b. Show that if f(x) is an irreducible polynomial in Q[x] of degree 5 having exactly two complex and three 
real zeros in C, then the group of f(x) over Q is isomorphic to Ss. [Hint: Use Sylow theory to show that 
the group has an element of order 5. Use the fact that f(x) has exactly two complex zeros to show that the 
group has an element of order 2. Then apply part (a).] 

c. The polynomial f(x) = 2x° — 5x* +5 is irreducible in Q[x], by the Eisenstein criterion, with p = 5. Use 
the techniques of calculus to find relative maxima and minima and to “graph the polynomial function /” well 
enough to see that f(x) must have exactly three real zeros in C. Conclude from part (b) and Theorem 56.4 
that f(x) is not solvable by radicals over Q. 
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Appendix: Matrix Algebra 


We give a brief summary of matrix algebra here. Matrices appear in examples in some 
chapters of the text and also are involved in several exercises. 
A matrix is a rectangular array of numbers. For example, the array 


2 -1 4 
5 1 | @ 


is a matrix having two rows and three columns. A matrix having m rows and n columns 
is an m <n matrix, so Matrix (1) is a 2 x 3 matrix. If m =n, the matrix is square. 
Entries in a matrix may be any type of number—integer, rational, real, or complex. We 
let Mnxn(R) be the set of all m x n matrices with real number entries. If m = n, the 
notation is abbreviated to M,,(R). We can similarly consider M,,(Z), M2x3(C), etc. 

Two matrices having the same number m of rows and the same number n of columns 
can be added in the obvious way: we add entries in corresponding positions. 


In M2,3(Z), we have 


2 -1 4 1 0 -3 3 -1 1 
E 1 +2 =| ele ~6 . as 
We will use uppercase letters to denote matrices. If A, B, and C are m x n matrices, 
it is easily seen that A+ B = B+ A and thatA+(B+C)=(A+8B)+C. 
Matrix multiplication, AB, is defined only if the number of columns of A is equal 


to the number of rows of B. That is, if A is anm x n matrix, then B must be ann x s 
matrix for some integer s. We start by defining as follows the product AB where A is a 
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A3 Example 


A4 Example 
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1 x v matrix and B is ann x 1 matrix: 


AB=[a; a2 +++ Gnl} . | = 41b1 tanb2 + +++ + anbn. (2) 

by 
Note that the result is a number. (We shall not distinguish between a number and the 
1 x | matrix having that number as its sole entry.) You may recognize this product as 


the dot product of vectors. Matrices having only one vow or only one column are row 
vectors or column vectors, respectively. 


We find that 


1 
[3 -—7 2)|/4}) =@3)0)+(-72)4) + @G) = 15. A 
5 


Let A be an m x n matrix and let B be ana x s matrix. Note that the number n of 
entries in each row of A is the same as the number n of entries in each column of B. 
The product C = AB is anm x s matrix. The entry in the ith row and jth column of AB 
is the product of the ith row of A times the jth column of B as defined by Eq. (2) and 
illustrated in Example A2. 


Compute 


Solution Note that A is 2 x 3 and B is 3 x 4. Thus AB will be 2 x 4. The entry in its 
second row and third column is 


2 
(2nd row A)Grdcolumn B)=[1 4 6]!1]=2+4+12=18. 
2 
Computing all eight entries of AB in this fashion, we obtain 
22 9 6 
aB= [i 17 18 AR * 


The product 


i wa alls 4] 


is not defined, since the number of entries in a row of the first matrix is not equal to the 
number of entries in a column of the second matrix. A 


For square matrices of the same size, both addition and multiplication are always 
defined. Exercise 10 asks us to illustrate the following fact. 


A5 Example 
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Matrix multiplication is not commutative. 


That is, AB need not equal BA even when both products are defined, as for A, B € 
M)(Z). It can be shown that A(BC) = (AB)C and A(B + C) = AB + AC whenever 
all these expressions are defined. 

We let I, be the n x n matrix with entries 1 along the diagonal from the upper-left 
corner to the lower-right corner, and entries 0 elsewhere. For example, 


Oro 


1 0 
IZ =|0 0 
0 1 
It is easy to see that if A is any n x s matrix and B is any r x n matrix, then J,A = A 
and BI, = B. That is, the matrix J, acts much as the number | does for multiplication 
when multiplication by J, is defined. 
Let A be an nm Xn matrix and consider a matrix equation of the form AX = B, 
where A and B are known but X is unknown. If we can find ann x n matrix A~! such 
that A~1A = AA7! = J,,, then we can conclude that 


A (AX) = AT'B, (A“1A\X = AT'B, I,X = Aw'B, X=Aq'B, 


and we have found the desired matrix X. Such a matrix A~' acts like the reciprocal of a 
number: A~!A = J, and (1/r)r = 1. This is the reason for the notation AT}, 

if A7! exists, the square matrix A is invertible and A~! is the inverse of A. If 
A! does not exist, then A is said to be singular. It can be shown that if there exists a 
matrix A~! such that A~!A = J,, then AA7! = J, also, and furthermore, there is only 
one matrix A~! having this property. 


Let 


We can check that 
—4 91/2 9) [2 9}|-4 9; {1 0 
1 —2//1 4) J|1 4 1 -—2/7 470 1]° 


ka Bh a 


We leave the problems of determining the existence of A~! and its computation to 
a course in linear algebra. 

Associated with each square n x n matrix A is a number called the determinant 
of A and denoted by det(A). This number can be computed as sums and differences 
of certain products of the numbers that appear in the matrix A. For example, the 


Thus, 
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determinant of the 2 x 2 matrix k A is ad — bc. Note that an n x 1 matrix with 


real number entries can be viewed as giving coordinates of a point in n-dimensional 
Euclidean space R”. Multiplication of such a single column matrix on the left by a real 
n x n matrix A produces another such single column matrix corresponding to another 
point in R". This multiplication on the left by A thus gives a map of R” into itself. It can 
be shown that a piece of R” of volume V is mapped by this multiplication by A into a 
piece of volume |det(A)| - V. This is one of the reasons that determinants are important. 

The following properties of determinants for nm x n matrices A and B are of interest 
in this text: 


det(/,) = 1 

det(AB) = det(A) det(B)) 

det(A) # 0 if and only if A is an invertible matrix 

If B is obtained from A by interchanging two rows (or two columns) of A, 
then det(B) = — det(A) 

If every entry of A is zero above the main diagonal from the upper left corner 


to the lower right corner, then det (A) is the product of the entries on this 
diagonal. The same is true if all entries below the main diagonal are zero. 


PY N 


EXERCISES A , 


In Exercises 1 through 9, compute the given arithmetic matrix expression, if it is defined. 


Dd 
[2 

1+i 
2. 4 

i = 
3. | 4 

3. -2i 

1 -l 
ra 

4 -1 
ff 4 

1 21 
afi 
10. 


11 


. Find E 


I] 


1 


i 0 


4 


1 


_ 


4i 
ee 
4 
af Sh ei 
|e ee 
rae i ee 
is Es ne es 


. Give an example in M2(Z) showing that matrix multiplication is not commutative. 


-1 
, by experimentation if necessary. 
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I 


20 O 
12. Find|0 4 01] ,by experimentation if necessary. 
0 0 —-1 
3 0 0 
13. IfA=}]10 —2 0], find det (A). 
4 17 8 


14. Prove that if A, B € M,C) are invertible, then AB and BA are invertibie also. 
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Notations 


eaes 

ia 

€a¢éS 

{x | P@x)} 
BCA 
BCA 

* AxB 


=,,a = b(modn) 
P(A) 

U 

R. 

Te 

U, 


membership, 1 

empty set, 1 
nonmembership, 1 

set of all x such that P(x), 1 
set inclusion, 2 

subset B # A, 2 

Cartesian product of sets, 3 
integers, 3 

rational numbers, 3 

real numbers, 3 

complex numbers, 3 
positive elements of Z, Q, I 
nonzero elements of Z, Q, R 
relation, 3 

number of elements in A, 4: as order of group, 50 
mapping of A into B by ¢,4 

image of element a under ¢, 4 

image of set A under ¢, 4 

one-to-one correspondence, 4 

the inverse function of @, 5 

cardinality of Zt, 5 

cell containing x € S in a partition of 5, 6 
congruence modulo n, 7 

power set of A, 9 

set of all z ¢ C such that |z| = 1, 15 

set of all x € R such thatO <x <c, 16 

addition modulo c, 16 

group of nth roots of unity, 18 


3 
C, 3 
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Zn 

*,a*b 

0, fog,ot 
(S, >) 

PS eS! 

e 

Maxn(S) 
MAS) 
GL(n, R) 
det(A) 
a7',-—a 
H<G;K<L 
H<G;K <L 
{a) 

nZ 

gcd 

er Si; 

S10 82.9---NS, 
Sa 

L 

a Sh 
n! 

D, 

An 
aH,a+H 
Ha,H+a 
(G: H) 

ge 

Thea: 5, 

Sy) x Sox +++ x S, 
Tha G: 
Bi Gi 

Icom 

G; 

be 

Tj 

~-"[B] 
Ker(@) 
G/N; R/N 
¥ 

Tg 

Z(G) 

Cc 


{0, 1,2,---,2—1}, 18 

cyclic group {0, 1,---,2 — 1} under addition modulo n, 54 

group of residue classes modulo n, 137 

ring {0, 1, ---, — 1} under addition and multiplication 
modulo n, 169 

binary operation, 20 

function composition, 22, 76 

binary structure, 29 

isomorphic structures, 30 

identity element, 32 

m <n matrices with entries from S, 40, 477 

n X n matrices with entries from S, 40, 477 

general linear group of degree n, 40 

determinant of square matrix A, 46, 479 

inverse of a, 49 

subgroup inclusion, 50; substructure inclusion, 173 

subgroup H # G, 50; substructure K #4 L, 173 

cyclic subgroup generated by a, 54 

principal ideal generated by a, 250 

subgroup of Z generated by n, 54 

subring (ideal) of Z generated by n, 169, 250 

greatest common divisor, 62, 258, 395 

intersection of sets, 69 


group of permutations of A, 77 
identity map, 77 

symmetric group on n letters, 78 
n factorial, 78 

nth dihedral group, 79 
alternating group on n letters, 93 
left coset of H containing a, 97 
right coset of H containing a, 97 
index of H in G, 101 

Euler phi-function, 104, 187 
Cartesian product of sets, 104 


direct product of groups, 104, 105 
direct sum of groups, 105 

least common multiple, 107 

natural subgroup of [[j_, G;, 107 
evaluation homomorphism, 126 
projection onto ith component, 127 
inverse image of the set B under ¢, 128 
kernel of homomorphism ¢, 129 

factor group, 137; factor ring, 242 
canonical residue class map, 139, 140 
inner automorphism, 141 

center of the group G, 150 
commutator subgroup, 150 

subset of elements of X left fixed by g, 157 


Ri[x]] 
F(()) 
F[x] 

V(S) 

(by, +++, B;) 
Wf) 

Ip(f) 

irr(a, F) 
deg(a, F’) 
F(@) 

[E : F] 
F(a, see, On) 


H®(X) 


Ay(A/A) 
A(X, Y) 
a\lb 
UFD 


Notations 


isotropy subgroup of elements of G leaving x fixed, 157 
orbit of x under G, 158 

polynomial ring with coefficients in R, 200 

field of quotients of F[xj, 201 

field of rational functions in n indeterminates, 201 
cyclotomic polynomial of degree p — 1, 216, 217 
endomorphisms of A, 221 

group ring, 223 

group algebra over the field F, 223 

quaternions, 224, 225 

formal power series ring in x over R, 231 

formal Laurent series field in x over F, 231 

ring of polynomials in x,,---, x, over F, 255 
algebraic variety of polynomials in S, 255 

ideal generated by elements )), ---,b,, 255 

leading term of the polynomial f, 260 

power product of lt( f), 260 

irreducible polynomial for a over F, 269 

degree of a over F, 269 

field obtained by adjoining a to field F’, 270 

degree of E over F, 283 

field obtained by adjoining a, --- 
algebraic closure of F in E, 286 
an algebraic closure of F, 287, 288 
Galois field of order p”, 300 
product set, 308 

subgroup join, 308 

normalizer of H, 323 

free group on A, 341, 342 

group presentation, 348 
boundary homomorphism, 357 
n-chains of X, 358 

n-cycles of X, 359 

n-boundaries of X, 359 

nth homology group of X, 361 
coboundary homomorphism, 363 
n-cochains of X, 363 

n-cocycles of X, 363 
n-coboundaries of X, 363 

nth cohomology group of X, 363 
n-sphere, 364 

n-cell or n-ball, 364 

Euler characteristic of X, 374 


, a, to F, 285 


homology homomorphism induced from f : X > Y, 375, 381 


chain complex, 381 
relative boundary operator, 382 


kth relative homology group of chain complex A modulo A’, 383 
kth relative homology of simplicial complex X modulo Y, 383 


a divides (is a factor of) b, 389 
unique factorization domain, 390 
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PID 

Vier Si, 

5; US) U+-- US, 
Vv 

N(@) 

Wop 

Ete) En 
G(E/F) 

{E: F} 


principal ideal domain, 391 
union of sets, 391 


Euclidean norm, 401 

norm of a, 408, 410, 455 

conjugation isomorphism of F(a) with F(f), 416 
subfield of F left fixed by all o; or alla € H, 419 
automorphism group of EF over F’, 420 

index of E over F, 428 


Answers to Odd-Numbered 
Exercises Not Asking for 
Definitions or Proofs 


SECTION 0 
1. {- V3, V3} 
3. {1,-1,2, -2, 3, -3,4, -4, 5, —5, 6, —6, 10, -10, 12, —12, 15, —15, 20, —20, 30, —30, 60, —60} 
5. Notaset (not well defined). A case can also be made for the empty set @. 
7. The set @ 7 
9. The set Q i 
11. (@, 1), G, 2), (a,c), (0, D, (8, 2), (Bc), (e, 1), (e, 2), (c, ¢) 
13. Draw the line through P and x, and let y be the point where it intersects the line segment CD. 
17. Conjecture: n(/(A)) = 2°. (Proofs are usually omitted from answers.) 
21. 107, 10°, 10%0 = 1280 = 280 = ||. (The numbers x where 0 < x < 1 can be written to base 12 and to base 2 as well 
as to base 10.) 
23. 1 25. 5 27. 52 
29. Not an equivalence relation 
31. An equivalence relation; 0 = {0},a = {a, —a} for cach nonzeroa € R 
33. An equivalence relation; 
T={i,2,+--, 9}, 
10 = {10, 11, ++, 99}, 
100 = {100, 101, ---, 999}, and in general 
107 = {107,107 + 1,---,10"*? — 1} 
35. i. {1,3,5,---}, {2, 4, 6, ---} 
fi, {1,4, 7, ---}, {2, 5,8, ---}, (3, 6,9, ---} 
iii. {1,6, 11, +++}, {2, 7, 12, ---}, (3, 8, 13, ---}, (4,9, 14, -- +}, (5, 10, 15, «+ +} 
37. The name two-to-two function suggests that such a function f should carry every pair of distinct points into two distinct 


points. Such a function is one to one in the conventional sense. (If the domain has only one element, a function cannot 
fail to be two to two, since the only way it can fail to be two to two is to carry two points into one point, and the set does 
not have two points.) Conversely, every function that is one to one in the conventional sense carries any pair of points 
into two distinct points. Thus the functions conventionally called one to one are precisely those that carry two points 
into two points, which is a much more intuitive unidirectional way of regarding them. Also, the standard way of trying 
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to show a function is one to one is precisely to show that it does not carry two points into just one point. Thus, proving 
a function is one to one becomes more natural in the two-to-two terminology. 


SECTION 1 
1. -i 3. -i 5. 2347i 
7. 17-151 9, -4+4i 
1 1 
u. WB 13. J(- + =i) 
v2 V2 
3 5 
15. V8 (+H) 
J34 34 B 
1 1 1 1 373 3 
17. + —i, + —i 19. 3i,4—— -—= 
J2 V2) V2 V2 2 a 
21. /34i,42i,-V34i 23. 4 25, - 27. J2 
29. 11 31. 5 33. 1,7 


35. 0°36 0,067, 0464, 61,056 6,07 33 
37. With ¢ < 4, we must have ¢? = 2, ¢3 — 0, and ¢4 © 4 again, which is impossible for a one-to-one correspondence. 
39, Multiplying, we obtain 


2122 = |21||Z2|[(cos 6, cos 6, — sin @, sin 62) + (cos 4, sin 8, + sin 4; cos 63)i] 


and the desired result follows at once from Exercise 38 and the equation |Z) ||z2] = |z12Z2|. 


SECTION 2 


1. e,b,a 3. a,c. * 48 not associative. 
5. Top row: d; second row: a; fourth row: c, b. 
7. Not commutative, not associative 
9. Commutative, associative 
11. Not commutative, not associative 
13. 8, 729, nlrb?) 
17. No. Condition 2 is violated. 19. Yes 
21. No. Condition 1 is violated. 
23. a. Yes. b. Yes 
25. Let S = {?, A}. Define * and *’ on S by a%b =? and a ¥’ b= A for all a,b € S. (Other answers are possible.) 
27. True 29, True 
31. False. Let f(x) = x?, g(x) = x, and h(x) = 2x + 1. Then 
(f(x) — g(x) — A(x) = x? — 3x — 1 but 
f(x) — (g@) — h@)) =x? -(-x -D HP tat 


33. True 35, False. Let * be + and let x’ be on Z. 
SECTION 3 
1. i. @ must be one to one. ii. @[S] must be all of S’. 


iii. g(a x b) = h(a) * b(b) for alla,be 8. 
3. No, because ¢ does not map Z onto Z’. @(n) # 1 foralln € Z. 
5. Yes. 7. Yes 9. Yes 
11. No, because @(x?) = (x? + 1). 
13. No, because #(f) = x + 1 has no solution f € F. 
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15. No, because #(f) = 1 has no solution f € F. 
17. a m*n=mn—m—n +2; identity element 2 
b. m*n =imn-+m +n; identity clement 0 
1 
19. a.axb= qe? +a-+b-— 2); identity element 2 
2 2 
bo axb=3ab-—a-—b+ 3° identity element — 
25. No. If (S, *) has a left identity element e, and a right identity element er, then e; = eg. (It is our practice to omit proofs 
from answers.) 
SECTION 4 
1. No. & fails. 3. No. & fails. 5. No. & fails. 
7. The group (Uig90, +) of solutions of z!°°° = 1 in C under multiplication has 1000 elements. 
9. An equation of the form x «x * x «x =e has four solutions in (U, -), one solution in (R, +), and two solutions in 
(R*,-). 
11. Yes 13. Yes 
15. No. The matrix with all entries 0 is upper triangular, but has no inverse. 
17. Yes. 
19, (Proofs are omitted.) ec —1/3 
21, 2, 3. (It gets harder for 4 elements, where the answer is not 4.) 
25. a F oa fe e. F g. T i. F 
SECTION 5 
1. Yes 3. Yes 5. Yes 7. Q and {x” |n € Z} 9. Yes 
11. No. Not closed under multiplication. 
13. Yes 
15. a. Yes b. No. It is not even a subset of F. 
17. a. No. Not closed under addition. b. Yes 
19. a. Yes b. No. The zero constant function is not in F. 
21. a. —50, —25, 0, 25, 50 b. 4, 2, 1, 1/2, 1/4 c. 1,0, 27, 1/x, 1/n? 
23. All matrices E forneZ 
: 4" 0 0 241 
25. All matrices of the form k A or ee 0 forn € Z 
27. 4 29. 3 31. 4 33. 2 35. 3 
39. a. T e T e F g. F i. 7 
SECTION 6 
1. g=4,r=6 3. gq=-7,r=6 5. 8 7. 60 
9. 4 11. 16 13. 2 15. 2 17. 6 19. 4 
21. An infinite cyclic group 
23. Z36 
es 
(2) (3 


) 
Se 5 
yee 


(18) 


ie 
SE eR Oy 
Pe Sen ee 

(0) 
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25, 
33. 


39, 
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1,2, 3,6 27. 1, 2,3, 4,6, 12 29. 1,17 
The Klein 4-group 35. Zo 37. Ze 


xl + i/3) and 3 —iV¥3) 


1 1 1 1 
M1. 53 +1), 503-0, 5(-V3 + 5OV3-| 
51. (p—-lq-) 
SECTION 7 
1. 0,1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 3. 0, 2, 4, 6, 8, 10, 12, 14, 16 
5. .--,-24, -18, —12, —6, 0, 6, 12, 18, 24, --- 
7. a. ab b. a? a a 
9. 


d | f 
d|f 
f fd 


11. Choose a pair of generating directed arcs, call them arc and arc2, start at any vertex of the digraph, and see if the 
sequences arcl, arc2 and arc2, arc! lead to the same vertex. (This corresponds to asking if the two corresponding group 
generators commute.) The group is commutative if and only if these two sequences lead to the same vertex for every 
pair of generating directed arcs. 

13. It is not obvious, since a digraph of a cyclic group might be formed using a generating set of two or more elements, no 
one of which generates the group. 

15. 

17. a. Starting from any vertex a, every path through the graph that terminates at that same vertex a represents a product 

of generators or their inverses that is equal to the identity and thus gives a relation. 
b. a*# =e,b? =e, (aby =e 
SECTION 8 
1 123 45 6 3 12 3 4 5 6 
“A123 65 4 “\3 4 162 5 
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5 G 23 45 :) 
“\2 61 5 4 3 
7 2 9. 1 

11. {1, 2, 3, 4, 5, 6} 13. {1,5} 

15. ©, p, p*, 0°, d, pd, p’d, p'd where their ¢ is our 2;. This gives our elements in the order po, 01, 02, 03, LH, 61, M2, 52. 

17, 24 

19. Referring to Table 8.12, we find that (9) = {0}, (o1) = (3) = {Po P1, P2, P3t, (2) = (eo, Prt (Hi) = (eo, Ma}; 

(142) = {Po, Ha}, (61) = {00, 81}, and (52) = {0, 52}. These are all the cyclic subgroups. A subgroup containing one of 
the “turn the square over” permutations 41, 42, 5), or 62 and also containing ; or p3 will describe all positions of the 
square so it must be the entire group D,. Checking the line of the table opposite j2;, we see that the only other elements 
that can be in a proper subgroup with 2; are p2, f42, and, of course, 09. We check that {, 02, 1, {42} is closed under 
multiplication and is a subgroup. Checking the row of the table opposite jz. gives the same subgroup. Checking the 
rows opposite 5, and opposite 52 gives the subgroup {9, p2, 51, 52} as the only remaining possibility, using the same 
reasoning. 

21. a. These are “elementary permutation matrices,” resulting from permuting the rows of the identity matrix. When another 
matrix A is multiplied on the left by one of these matrices P, the rows of A are permuted in the same fashion that 
the rows of the 3 x 3 identity matrix were permuted to obtain P. Because all 6 possible permutations of the three 
rows are present, we see they will act just like the elements of S; in permuting the entries 1, 2, 3 of the given column 
vector. Thus they form a group because $3 is a group. 

b. The symmetric group $3. 
23. Zp 25. D4 
0 12 3 0 12 3 0 1 2 3 0 1 2 3 
op Moe ee io= (4 i2 :} m=(t 2 3 5) n=(3 3 0 n= (3 0 1 
The table for the left regular representation is the same as the table for Z, with n replaced by A,. For $3, po = 
IS Hees SUEY, TNS ms) pi = (" pie pew ee a etc., where the bottom row in the permutation 
Tro Tr To My M2 M3 Tr, 12 To M2 M3 My, 
Pz consists of the elements of 5; in the order they appear down the column under in Table 8.8. The table for this right 
regular representation is the same as the table for $3 with o replaced by p,. 

31. Nota permutation 33. Not a permutation 

35. a T ce T e. T g. F i. F 

37. A monoid 41. No 43. Yes 

SECTION 9 

1. (1,2, 5}, {3}, {4, 6} 
3. {1,2, 3, 4, 5}, {6}, {7, 8} 
5. (2n|n € Z}, (2n+1|n € Z} 
7 6 23 45 67 *) 
“\4 135 8 62 7 
9 € 2345 67 i) 
"\5 4378 62 1 
11. (1, 3, 4)(2, 6), 8,7) =, HC, 32, 65, DG, 8) 
13. a4 
b. A cycle of length n has order n. 
c. o has order 6; t has order 4. 
d. 6 in Exercises 10 and 11, 8 in Exercise 12. 
e, The order of a permutation expressed as a product of disjoint cycles is the least common multiple of the lengths of 
the cycles. 
15. 6 17. 30 
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19. a 2,3) 
/ \ 
/ \ 
i \ 
f \ 
/ 432 . 
‘ (1, 3.2) 
/ \ 
! \ 
/ (2,3,4) 
i \ 
é \ 
f \ 
/ \ 
/ 4,2 (1, 3)(2, 4) 
/ G42) (29249 


2,43) (1,412.3) 
(1, 2)(3, 4) 


23. a F a F e. F g. T i. T 
SECTION 10 
1 AZ = (++, -8, —4,0, 4,8, -°4 
1+4Z=(---,-7,—3,1,5,9,---} 
244Z ={---,-6,-2,2,6, 10, +++} 


344Z={.--,-5,-1,3,7,11,-°+} 
3. (2) = (0,2, 4,6, 8, 10}, 1 +42) = {1,3,5,7,9, 11} 
(18) = {0, 18}, 1 + (18) = {1, 19}, 2 + (18) = (2, 20), ---,17 + (18) = (17, 35} 


5. 

7. {Po, ta}, {p1, 5}, {P25 Ha}, {03, 55}. Not the same. 

9. {p0, pr}, {pis 03}, {His Ha}, 181, 62} 
11. Yes, we get a coset group isomorphic to the Klein 4-group V. 
13. 3 15. 24 

19. a T ce T e. T g. T i. F 


21. G=Z,, subgroup H = Zp. 
23. Impossible. The number of cells must divide the order of the group, and 12 does not divide 6. 
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SECTION 11 


1. Element Order | Element Order 


(0, 0) 1 (0, 2) 2 
(1, 9) 2 (1, 2) 2 
(0, 1) 4 (0, 3) 4 
(4, 1) 4 (1, 3) 4 


The group is not cyclic 


3, 2 


5. 9 7. 60 


9. {(0, 0), (0, 1}, {@, 0), (1, 0)}, {@, 0), C, D} 
11. {(0, 0), (0, 1), (0, 2), (0, 3)} 
{(, 0), (0, 2), (1, 0), (1, 2)} 
{(, 0), (1, 1), 0, 2), 1, 3)} 

13. Za x Za, Z5 x La, Zi2 X Zs, Zs x Za X Za 


15. 12 
17. 120 
19. 180 


21. Zs, Z, x Zy, Z, x Zp x Ze 
23. Z32, 2 X Dye, Zs X Lg, Ly X Le X Ze, Zn X La X La, 
Zo X Ly X Dy X Ly, Zn X Tn X Ey X Zo X Zo 
25. Zo x Zin, Z3 X Zs X Z13;, Zo x Zyy x Zy, Zs x Zs x Zy x Zy 


29. a 
b 
31. a 
b 
c 
d. 


a[2|3|4)s|e)7 | 3 
number of groups | 2] 3} 5] 7 | 11 | 15 | 22 


. i) 225 ii) 225 iii) 110 

It is abelian when the arrows on both n-gons have the same (clockwise or counterclockwise) direction. 
fe Zo x Zn 

When n is odd. 

. The dihedral group D,. 


33. Z» is an example. 
35. S3 is an example. 


37. The numbers are the same. 41. {-1,]} 
SECTION 12 
1. a. The only isometries of R leaving a number c fixed are the reflection through c that carries c + x to c — x for all 


b. 


x €R, and the identity map. 

The isometries of IR? that leave a point P fixed are the rotations about P through any angle 6 where 0 < @ < 360° 

and the reflections across any axis that passes through P. 

. The only isometries of R that carry a line segment into itself are the reflection through the midpoint of the line 
segment (see the answer to part (a)) and the identity map. 

. The isometries of R? that carry a line segment into itself are a rotation of 180° about the midpoint of the line segment, 
a reflection in the axis containing the line segment, a reflection in the axis perpendicular to the line segment at its 
midpoint, and the identity map. 

. The isometries of R? that carry a line segment into itself include rotations through any angle about an axis that 
contains the line segment, reflections across any plane that contains the line segment, and reflection across the plane 
perpendicular to the line segment at its midpoint. 
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9. 


11. 
17. 
19. 


21. 


25. 
27. 
29. 
31. 
33. 
35. 
37. 
39. 
41. 
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7. 


Translation: order oo 

Rotation: order any n > 2 or 00 
Reflection: order 2 

Glide reflection: order 00 


Rotations 13, Only the identity and reflections. 

Yes. The product of two translations is a translation and the inverse of a translation is a translation. 

Yes. There is only one reflection jz across one particular line L, and 4” is the identity, so we have a group isomorphic 
to Z. 

Only reflections and rotations (and the identity) because translations and glide reflections do not have finite order in the 
group of all plane isometries. 

a. No b. No c. Yes d. No e. D 
a. Yes b. No *¢. No d. No e. Da 
a. No b. No c. No d. Yes e. Z 
a. Yes. 90°, 180° b. Yes c. No 

a. No b. No ce. No 

a. Yes. 180° b. Yes c. No 

a. Yes. 120° b. Yes c. No 

a. Yes. 90°, 180° b. Yes c. No d. (-—1, 1) and (1, 1) 
a. Yes. 120° b. Yes c. No d. (0, 1) and (1, V3) 


SECTION 13 


Yes 3. Yes 5. No 

Yes 9. Yes 

Yes 13. Yes 15. No 

Ker(@) = 7Z; (25) = 2 

Ker(¢) = 6Z; 6(20) = (1, 2, 7)(4, 5, 6) 

Ker(¢) = (0, 4, 8, 12, 16, 20}; (14) = (1, 6)¢4, 7) 

Ker(g) = {(0, 0)}; 6(4, 6) = @, 18) 

2 27, 2 29. Forallg ¢G 

No nontrivial homomorphism. By Theorem 13.12, the image of ¢ would have to be a subgroup of Zs, and hence all of 
Zs for a nontrivial ¢. But the number of cosets of a subgroup of a finite group is a divisor of the order of the group, and 
5 does not divide 12. 
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35. Let d(m,n) = (m, 0) for (m, n) € Zz x Za. 
37. Let d(n) = p, forn € Zs, using our notation in the text for elements of 53. 
39. Let d(m,n) = 2m. 
41, Viewing D, as a group of permutations, let @(o) = (1, 2) for odd o € D, and $(c) be the identity for even ao € Dg. 
43. Let @(c) = (1, 2) for odd o € Sy and ¢(c) be the identity element for even o € Sy. 
51. The image of ¢ is (a), and Ker() must be some subgroup nZ of Z. 
53. hk =kh 55. h” must be the identity e of G. 
SECTION 14 
1, 3 3. 4 5. 2 7 2 
9, 4 11. 3 13. 4 15. 1 
21. a. When working with a factor group G/H, you would let a and b be elements of G, not elements of G/H. The student 
probably does not understand what elements of G/H look like and can write nothing sensible concerning them. 
b. We must show that G/H is abelian. Let aH and bH be two elements of G/H. 
23. a. T a T e T g. T i. T 
29. {po, 1}, {00, Ha}, and {po 43} 
35. Example: Let G = N = 38s, and let H = {po, 1}. Then N is normal in G, but HO N = Z is not normal in G. 
SECTION 15 
1. Z 3. Z4 5. Z4 x Zs 7, 2 9. Z2,xZx ky 
1. 4xZ 13. Z(Da) = C = {0, 2} 
> 15. Z(S; x Da) = {(P0, Po), (Po, P2)}, using the notations for these groups in Section 8, C = A3 x {/o, p2}. 
19. a. T c. F “e. F g. F ae 
21. {f € F*|f)= 1} 
23. Yes. Let f(x) = 1 for x > OQ and f(x) = —1 forx < 0. Then f(x): f(x) = 1 forall x, so f? € K* but f is notin K*. 
Thus f K* has order 2 in F*/K*. 
25. U 
27. The multiplicative group U of complex numbers of absolute value 1 
29. Let G =Z, x Zs. Then H = ((1,0)) is isomorphic to K = ((0, 2)), but G/H is isomorphic to Z4 while G/K is 
isomorphic to Z x Zp. 
31. a. fe} b. The whole group 
SECTION 16 
1. X py = X, Xp, = {Cc}, Xp, = {m1, M2, d, d>, Ch, Xo, = {C}, 
Xy, = {81, 53,1, M2, C, Pi, P3}, Xu, = {52, 54,1, M2, C, Po, Pa}, 
Xs, = (2,4, d,,d,, C}, Xs. = (1,3, 41, d, Ch. 
3. {1, 2,3, 4}, {51, 52, 53, 54}, {71 ma}, {d:, do}, {C}, (Pi, Po, P3, Pa} 
7. A transitive G-set has just one orbit. 
9. a. {81, 82, 53, Sa} and {P), Po, P3, Ps} 
13. b. The set of points on the circle with center at the origin and passing through P 
c. The cyclic subgroup (27) of G=R 
17. a. K = goHg,!. 
b. Conjecture: H and K should be conjugate subgroups of G. 
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19. There are four of them: X, Y, Z, and Zg. 


SECTION 17 


1, 5 3. 2 
7. a. 45 b. 231 
9. a. 90 b 


5. 11,712 


i 
i 
i 


SECTION 18 


1. 0 3.1 5. (1,6) 
7. Commutative ring, no unity, not a field 
9, Commutative ring with unity, not a field 
11, Commutative ring with unity’ not a field 
13. No. {ri|r € R} is not closed under multiplication. 
15. (, 1),0, -1), (-1, 1), (-1,-D 
17, Allnonzerog €Q 19. 1,3 
21. Let R = Z with unity 1 and R’ = Z x Z with unity 1’ = (1, 1). Let 6: R > R’ be defined by ¢(n) = (n, 0). Then 
Pd) = (1.0) 41. 
23. $,:Z— Z where O(n) = 0, ¢2.:Z > Z where O(n) =n 
25. ¢,:ZxZ— Zwhere o)(n,m) =0,¢.:Zx Z—- Zwhere ¢o(n,m) =n 
¢3;:Z2xZ—> Z where 63(n,m) =m 
27. The reasoning is not correct since a product (X — J;)(X + I;) of two matrices may be the zero matrix 0 without having 
either matrix be 0. Counterexample: 
00 1)" 
0 1 0] =h. 
1 0 0 
31. a=2,b=3inZ 
33. a. T a F e T g. T i. T 


SECTION 19 
1. 0,3,5,8,9, 11 3. No solutions 5. 0 7. 0 9. 12 


1k. at +2a?b? + 54 13. a° +2a3b3 + 5° 
17. a F ce F e. T g. F i. F 


Answers to Odd-Numbered Exercises 501 


19, 1, Det(A) = 0. 2. The column vectors of A are dependent. 

3. The row vectors of A are dependent. 4. Zero is an eigenvalue of A. 
5. A is not invertible. 
SECTION 20 

1. 3o0r5 3. Any of 3,5, 6, 7, 10, 11, 12, or 14. 5. 2 

7. g()=1 g(7) =6 g(13) = 12 g(19) = 18 g(25) = 20 
g2=1 (8) =4 (14) = 6 g(20) = 8 (26) = 12 
g3=2 yg) =6 ps5) =8 g21)=12 = g27)= 18 
g(4) =2 g(10) = 4 g(16) = 8 9(22) = 10 g(28) = 12 
g(5) =4 g(11) = 10 g(17) = 16 (23) = 22 (29) = 28 
pOH=2 gpl2=4 p(18) = 6 e(24) = 8 (30) = 8 

9 (p-Di¢-) 11. 14+4Z,3+42Z 13. No solutions 

15. No solutions 

17. 3+ 65Z, 16 + 65Z, 29 + 65Z, 42 + 65Z, 55 + 65Z 

19. 1 21. 9 

23. a. F ce. T e. T g. F i. F 

SECTION 21 

L in+@ila.g €Q} 

15. Itis isomorphic to the ring D of all rational numbers that can be expressed as a quotient of integers with denominator 
some power of 2. 

17. Itruns into trouble when we try to prove the transitive property in the proof of Lemma 5.4.2, for multiplicative cancellation 
may not hold. For R = Z, and T = {1, 2, 4} we have (1, 2) ~ (, 4) since (1)(4) = @)(2) = 4 and (2, 4) ~ (2, 1) since 
(2)(1) = (4)(2) in Ze. However, (1, 2) is not equivalent to (2, 1) because (1)(1) ¥ (2)(2) in Ze. 

SECTION 22 

1. f(x) + g(x) = 2x7 +5, fx)g(x) = 6x7 +42 +6 
3. f(x) + g(x) = 5x? +5 4-1, fg) =x + 5x 
5. 16 7. 7 9. 2 11. 0 13. 2,3 15. 0,2,4 
17. 0,1,2,3 
21. 0,x —5,2x — 10, x? — 25, x? — 5x, x4 — 5x3. (Other answers are possible.) 
23. a. T ec T e. F g. T i. T 
25. a. They are the units of D. b. i, —1 ce. 1,2,3,4,5,6 
27. b. F c. F[x] 31. a. 4,27 b. Z: x Z2, Z3 x Z3 x Zy 
SECTION 23 
lL g(x) Haxttxe tx? tx -2.r(x) = 4x43 
3. q(x) = 6x4*+ Txi 42x? —x42, r(x) =4 
53. 2,3 7. 3,10,5, 11, 14,7, 12, 6 
9 @-De+1)@ -2)@ 42) 

Wl. @-3@4+3)Qx 43) 
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Yes. It is of degree 3 with no zeros in Zs. 
oe x Det 2 


15. Partial answer: g(x) is irreducible over R, but it is not irreducible over C. 
19. Yes. p =3 21. Yes.p=5 
25. a. T eT e. T g. T i. T 
27, x? +x+1 
29, x? + 1,4? +442, x? + 2x 4-2, 2x? +2, 2x? +441, 2x? +2041 
31. p(p — 1)?/2 
SECTION 24 
1. le+0a+3b 3. 2e+2a+2b 5. j 7. (1/50)7 — (3/50)k 
9. R*, that is, {a, + Oi +07 + 0k|a,; € R, a, 40} 
ll. a F ce. F e. F g. T i. T 
ce. If |A| = 1, then End(A) = {0}. e. 0 € End(A) is not in Iso(A). 
i 0 
19% a K = 0 —i|: 


b. Denoting by B the matrix with coefficient b and by C the matrix with coefficient ¢ and the 2 x 2 identity matrix by 
7, we must check that 


Be=-I,C?=-I,K?=~-I, 
CK =B,KB=C,CB=-—K,KC=-B, and BK =—-C. 


c. We should check that ¢ is one to one. 


SECTION 25 . 


Pe ae a 


a<x<x<x3 <--- <x"... foranyae R. 

m +nv/2 is positive if m > 0 andn <0, orifm > 0 and m? > 2n?, orifn <Oand 2n? > m?. 
i. acedb ii. ecbad 

i.dabce fi. dceab i 
i. caedb ii, ecbad ; 
dbaec 13. debac | 
a. T c F e. T g. T i. F 


SECTION 26 


1. 


9. 


There are just nine possibilities: 

@(1, 0) = C1, 0) while (0, 1) = (0, 0) or (0, 1), 

(1, 0) = (0, 1) while 6(0, 1) = (0, 0) or (1, 0), 

(1, 0) = 1, 1) while @(0, 1) = (0, 0), and 

(1, 0) = (0, 0) while ¢(0, 1) = (0, 0), (1, 0), (0, 1), or 1, 1). 
(0) = {0}, Zy2/{0} ~ Zig 

(1) = {0, 1, 2, 3,4, 5, 6, 7, 8,9, 10, 11}, Zj2/(1) ~ {0} 
(2) = {0, 2,4, 6, 8, 10}, Zi. /(2) > Zp 

(3) = {0, 3; 6, 9}, Zy2/ (3) ~Z; 

(4) = {0, 4, 8}, Zi2/(4) = Zy 

(6) = {0, 6}, Zi2/ (6) ~ Ze 

Letd?:Z— Z~x Zbe given by ¢(n) = (n, 0) forn € Z. 


11. 
13. 
15. 
31. 


35. 
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R/R and R/{0} are not of real interest because R/R is the ring containing only the zero element, and R/{0} is isomorphic 
to R. 

Z is an integral domain. Z/4Z is isomorphic to Z4, which has a divisor 2 of 0. 

{(n, n)|n € Z}. (Other answers are possible.) 

The nilradical of Z, is {0, 6}. The nilradical of Z is {0} and the nilradical of Z3. is 

{0, 2,4, 6, 8,--+, 30}. 

a. Let R = Zand let N = 4Z. Then VN = 2Z 4 4Z 

b. Let R = Zand let N = 2Z. Then /N = N. 


SECTION 27 


{0, 2, 4} and {0, 3} are both prime and maximal. 

{(0, 0), (1, 0)} and {(0, 0), (0, 1)} are both prime and maximal. 

1 7, 2 9% 1,4 15. 2Z2xZ 17. 4Z x {0} 

Yes. x? — 6x + 6 is irreducible over Q by Eisenstein with p = 2. 

Yes. Zo X Zy 

No. Enlarging the domain to a field of quotients, you would have to have a field containing two different prime fields 
Z, and Z,, which is impossible. 


SECTION 28 


27. 


—3x3 + Tx? y?z — 5x? yz3 + 2x25 

2x? yz? — Ixy22* — 7x +3y+ 10z3 

2 yx — S23 yx? + Tzy?x? — 3x3 

10z? — 2z?y*x + 2z?yx? + 3y — 7x 
l<ez<y<x<P<yey<xup<xy<P<P<yl<yz<y <x2 <xyz< 
xy? <x2¢<x7y <x <-- ; 

3y2z9 — Bz? + S5y3z3 — 4x 13. 3yz3 — 8xy —4xz + 2yz + 38 

wari +e a7) 17.) GF 435-39 = 22,972? +3) 

ay Dy eo 1} 

{2x + y —5, y? — 9y + 18} 


The algebraic variety is {(1, 3), (-$. 6)}. 


ee ey eel) 
The algebraic variety consists of one point (a, —a) where a © 1.3247. 
a. T ec T e T g. T i. F 


SECTION 29 


x*>—2x—-1 3. x?-—2x4+2 

x'? 4. 3x8 — 4x6 + 3x" i +5 

Irr(a, Q) = x* — ae Sgt deg(a, Q) = 4 
Algebraic, deg(a, F) = 2 

Transcendental 

Algebraic, deg(a, F) = 2 

Algebraic, deg(a, F) = 1 


etxetl=@w—a\x+ita) 
a. T e T e. F g. F i. F 


b x4 +ls(x-a@)(e -—o%) [x —A+a+a’)] 
It is the monic polynomial in F[x] of minimal degree having a as a zero. 
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SECTION 30 


1. {(0, 1), 0, 9}, (4, D, G1, Dh {@, 1D. G, 2)}. (Other answers are possible.) 

3. No. 2(—1, 1,2) — 4(2, —3, 1) + (10, —14, 0) = (0, 0, 0) 

5. {1} 7. {1 i} 9 (1, 72, V2, (/2)5} 
15. a. T Cot ea F g. F Le 
17. a. The subspace of V generated by S is the intersection of all subspaces of V containing S. 
19. Partial answer: A basis for F” is 


{(, 0,---, 0), (0, 1,---,0),---,@,0,---, D} 


where | is the multiplicative identity of F. 
25. a. A homomorphism 
b. Partial answer: The kernel (or nullspace) of ¢ is {a € V | d(@) = 0}. 
ec. ¢ is an isomorphism of V with V’ if Ker(¢) = {0} and ¢ maps V onto V’. 


SECTION 31 


1. 2, {1,72} 3. 4, (1, V3, V2, V6} 

5. 6, (1, V2, 7/2, J2(0/2), (/2)2, V2(0/2)7} 7. 2, {1, /6} 
9. 9, (1, 72, 74, V3, 76, 4/12, V9, 18, +/36} 

11. 2, {1, V2} 13. 2, {1, 12} 

19 a F a F e F g. F i. F 

23. Partial answer: Extensions of degree 2" for n € Z* are obtained. 


SECTION 32 % 


All odd-numbered answers require proofs and are not listed here. 


SECTION 33 
1. Yes 3. Yes 5. 6 7. 0 
SECTION 34 
1. a. K = (0,3, 6,9}. 
b. O+ K = {0,3,6,9},14+K = (1,47, 10},24+ K = {2,5,8, 11}. 
ce wO+K)=0,n0+K)=2,42+K)=1. 
3. a. HN = {0,2,4,6, 8, 10, 12, 14, 16, 18, 20, 22}, HNN = {0, 12}. 
b. 0+ N = {0, 6, 12, 18},2+ N = (2,8, 14, 20},44 N = {4, 10, 16, 22}. 
c. OF (HNN) = (0, 12},44+ (ANN) = {4,16}, 8+ (ANN) = {8, 20} 
d. 60+ N)=04+(HNN), 024+N)=8+(H NN), 644+ N) =44(F NN). 
5. a. O+H = {0,4, 8, 12, 16, 20},1 + A = {1, 5,9, 13, 17, 21}, 


24H = (2,6, 10, 14, 18, 22},3+ H = {3,7, 11, 15, 19, 23}. 
O+K = (0,8, 16},1+K = {1,9, 17},2+K = (2, 10, 18}, 
3+K = (3, 11, 19}, 

4+K ={4,12,20},54+K ={5, 13,21},64+ K = (6, 14,22}, 
7+ K ={7, 15, 23}. 


a 
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c. O+ K = {0,8, 16},4+ K = (4, 12, 20}. 
d. (0+ K)+(H/K) = H/K ={0+ K,44+ K} = {{0, 8, 16}, 4, 12, 20}} 
(L+K)+(A/K) ={14+K,5+K} ={{1, 9, 17}, (5, 13, 21}} 
(2+ K)+(H/K) ={(2+K,6+ K} = {{2, 10, 18}, (6, 14, 22}} 
(34 K)+(H/K) =(34+K,7+K} = {{3, 11, 19}, {7, 15, 23}}. 
e. #0 + H) = (0+ K) +(H/K), 6. + H) = (1+ K)+(H/K). 
6(2 + H) = (2+ K)+(H/K), $3 + H) =(3+ K)+(H/R). 


SECTION 35 
1. The refinements {0} < 250Z < 10Z < Z of {0} < 10Z < Zand {0} < 250Z < 25Z < Zof0 < 25Z < Z are isomor- 
phic. 
3. The given series are isomorphic. 
5. The refinements 
{(O, 0)} < (4800Z) x Z < (240Z) x Z < (60Z) x Z < (10Z) x Z < Z x Z of the first series and 
{(0, 0)} < Z x (4800Z) < Z x (480Z) < Z x (80Z) < Z x (20Z) < Z x Zof the second series are isomorphic refine- 
ments. 
7. {0} < (16) < (8) < (4) < (2) < Zug 
{0} < (24) < (8) < (4) < (2) < Zug 
{O} < (24) < (12) < (4) < (2) < Zug 
{O} < (24) < (12) < (6) < (2) < Zig 
{O} < (24) < (12) < (6) < (3) < Zug 
9. {(po, 0)} < A3 x {0} < S3 x (0) < S3 x Zy 
{(0, 9)} < {po} x Zz < Ax x 22 < 83x Z 
{(p0, 0)} < Az x {0} < Az x Z < S3 x Zy 
Hl. {00} x Za 13. {po} x Za < {fo} x Zs < {0} x Zs < 
17. a T c« T e F g. F i. T 
i. The Jordan-Hélder theorem applied to the groups Z,, implies the Fundamental Theorem of Arithmetic. 
19. Yes. {00} < {0, 02} < {00. 01, 02; 03} < D4 is a composition (actually a principal) series and all factor groups are 
isomorphic to Z, and are thus abelian. 
21. Chain (3) Chain (4) 
{0} < (12) < (12) < (12) {0} < (12) < (12) < (6) 
S (12) s (12) < (4) < (6) < (6) = (3) 
< (2) < Zoq < Day < (3) < Zo, < Zn4 
Isomorphisms 
(12}/{0} = os {{O} ~ Zz, (12) /(12) = (6)/{6) ~ {0}, 
(12}/{12) ~ (3)/(3) ~ {0}, (12)/(12) ~ oy {12} = {0}, 
(12)/{12) ~ (6)/(6 : ~ {0}, (4)/(12) & Za4/(3) & 
(2)/(4) ~ (6)/(12) = Zo, Zag /(2) ~ (3) /(6) = 
Zn4[o4 ~ Lh oe {0} 
SECTION 36 
1. 3 3. 1,3 
5. The Sylow 3-subgroups are ((1, 2, 3)), (1,2, 4). ((1, 3. 4)), and ((2, 3, 4)). Also @G, 4)(1, 2, 3))(3, 4 = (1, 2, 4)), 


etc. 
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SECTION 37 


1. a. The conjugate classes are {9}, {2}. {01, 03}, (41, Ho}, {81, do}. 
b. 8=24242+4+2 

3. a T c. F e T g. T i. F 
e. This is somewhat a matter of opinion. 

9. 24=1464+3+4+8+46 


SECTION 38 
1, {0,1,), C0, 2, D, d, 1, 2} 


3. No. n(2, 1) + m(4, 1) can never yield an odd number for first coordinate. 
7. 2Z4< Zrankr = 1 


SECTION 39 
1 a. @Ratctb?, b’ cath a” b. a 'beatc’a), ac a+b 3a 
3. a. 16 b. 36 c. 36 
5. a. 16 b. 36 c. 18 

11. a. Partial answer: {1} is a basis for Z4. c. Yes 

13. ec. Ablop group on S is isomorphic to the free group F[S]on S. 


SECTION 40 
1. (a@:a*=1); (a,b: at =1,b =a’); (a,b,c: a =1,b* = 1, ¢ = 1). (Other answers are possible.) 
3. Octic group: 


a@b|ab|a’b| ab| b | &@ | aia 1 


Quaternion group: The same as the table for the octic group except that the 16 entries in the lower right corner are 


5. 
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a a| i 
1 aia 
a 1 |a3 | a@ 


Zn. (a,b: a7 =1,b5 =1,ba =a’b) 


SECTION 41 


1. 


3. 


a. 2P, P3 = 3P, P4 oe P| Ps = 3P2P3 + 3P)P = 5 P3P4 + 4P3 Pe = 5 PaPs 
b. No c. Yes 
C)(P) = Z;(P) = BAP) = H,(P) = Ofori > 0. By(P) = 0. Z(P) ~ Zandis generated by the 0-cycle P. Ho(P) = Z. 


Ci(X) = ZX) = BAX) = A(X) = Ofori > 0. Bo(X) ~ Zand is generated by the Q-chain P2 — P}. Z)(X)=Zx Z 
and is generated by the two 0-cycles P, and P). Since Zo(X)/Bo(X) “indentifies P; with P,,’ Ho(X) ~ Z and is 
generated by the coset P; + Bo(X). 

a. An oriented n-simplex is an ordered sequance P; P2 +++ Py +1. 

b. The boundary of P, P,--- P,; is given by 


n+l 


On(P, Pa--- Pai) ) (HAE PL Pye Prat Pits Pra 
i=l 


c. Each individual summand of the boundary of an oriented n-simplex is a face of the simplex. 


11. a. so( Sma = Sid (03) 
13. A(X) = Z(X)/BO(X*) 
H(S) ~ Z and is generated by (P; + P2 + P; + Ps) + {0} 
HS) =0 
H®(S) ~ Zand is generated by P; P2P3 + BS) 
SECTION 42 
1. H(X) 2 Z. A(X) ~ Zx Z. AX) = Oforn > 1. 
3. Ho(X)~@Z x Z. W(X) = @. HX) = Z. 0, (X) = 0 forn > 2. 
5. HX) ~ Z. A(X) ~ Z. HX) ~ Z. A(X) = Oforn > 2. 
7. a. T a F e T g. T i. F 
9. H(X)~Z. W(X) ~ZxZx Z. A(X) =~ Zx Z. H,(X) = 0 forn > 2. 
VW. Ap(X)~ Z.A(X)2 2x @x Z@ x Z. Ay(X) ~ Z. H,(X) = Oforn > 2. 
SECTION 43 
1. Both counts show that X(X) = 1. 
3. It will hold for a square region, for such a region is homeomorphic to E?. It obviously does not hold for two disjoint 
2-cells, for each can be mapped continuously onto the other, and such a map has no fixed points. 
5. H(X)~Zx Z.A(X)2~ZxKZxZ x Zo. A(X) = OVtora > 1. 


2—2n 
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9. 


11. 


13. 
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A(X)2Z. A(X)~2xZx---x 2x. A,(X) =O forn > 1. 
(¢—1) factors 
Let Q be a vertex of b, and let c be the 2-chain consisting of all 2-simplexes of X, all oriented the same way, so that 
c € Z,(X). 
a. fio is given by f.o(Q + Bo(X)) = O + Bold). 
Fxi is given by fai(Qna + nb) + By(X)) = nb + Bib). 
Jo is given by fao(e + By(X)) = 0. 
b. fio is as in (a). 
fui is given by f,:((@ma + nb) + B,(X)) = 2nb + By(b). 
Fag is as in (a). 
Let Q be a vertex on bd. 
fao is given by f.o(Q + Bo(X)) = Q + Bold). 
Fis is given by f,1(ma + nb) + B,(X)) = nb + Bi (bd), where m = 0, 1. 
fez is trivial, since both H,(X) and H,(b) are 0. 


SECTION 44 


5. 


9. 
11. 


For Theorem 44.4, the condition f,_10, = 9; f; implies that 
fe-1(By-1(A)) & By-1(A’). 
Then Exercise 14.39 shows that f,_; induces a natural homomorphism of Z;,_1(A)/B,_1(A) into Z,_1(A)/ By_1(A)). 
This is the correct way to view Theorem 44.4. 
For Theorem 44.7, if we use Exercise 14.39, the fact that 0,(A;,) © Aj;_, shows that 4; induces a natural homomorphism 
Ox (Ag / AL) > (Ag-i/Ai_)- 
The exact homology sequence is 
* ix ha 842 
[H,(a) = 0] > [H,(X) ~ Z] 4 [H,(X, a) ~ Z] > [Ai (a) ~ Z] 
y [Hy(X) © Z x Z) 2s [H(X, a) © Z] s [o(a) © ZI 
—*s [Ho(X) ~ Z] 43 [Hy(X, a) = 0]. 
Jx2 Maps a generator c + B,(X) of H)(X) onto the generator 
(c + Ca(a)) + Bo(X, a) 
of H,(X, a) and is an isomorphism. Thus (Kernel j,2) = (image i...) = 0. 
0,2 aps everything onto 0, so (Kernal 0,2.) = (image j,) = Z. 
i,, maps the generator a+ B)(a) onto (a+ 0b) + B,(X), so i,, is an isomorphism into, and (kernal i,,) = 
(image 0,2) = 0. 
jx. Maps (ma + nb) + B,(X) onto (nb + C\(a)) + B(X, a), so (kernal j,,) = (image i,,) ~ Z. 
0,1 maps (nb + C,(a)) + B,(X, a) onto 0, so (kernal 0,,) = (image j,;) ~ Z. 
For a vertex Q of a,isg maps Q+ Bo(a) onto @+ Bo{X), so ing is an isomorphism, and (kernal i,o) = 
(image 0,;) = 0. 
jxo maps QO + Bo(X) onto Bo(X, a) in Hy(X, a), so (kernal j,9) = (image io) ~ Z. 
The answer is formally identical with that in Exercise 44.7. 


Partial answer: The exact homology sequence is 
[Hy(Y) = 0] > [H,(X) = 0) 2 [ay(X, ¥) ~ Z) 3 (HY) ~ Zx ZI 
del 


LAX) ~ Z) 2 (A(X, Y) ~ Z) 28> [Ho(Y) x Z x Z] 
®, [Hy(X) ~ Z] 2 (Ho(X, Y) = 01. 
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The verification of exactness is left to you. Note that the edge P, Q; of Fig. 42.11 gives rise to a generator of H)(X, Y). 
Starting with 3,2, these maps are very interesting. 


SECTION 45 


1. Yes 3. No 5. No. 7, Yes 
9. In Z[x]: only 2x —7,-2x +7 
7 
In Q[x] : 4x — 14, x - 7 6x — 21, —8x + 28 
In Zy1 [x] : 2x — 7, 10x — 2, 6x + 1,3x —5, 5x -1 


11. 26, —26 13. 198, —198 

15. Itis already “primitive” because every nonzero element of Q is a unit. Indeed 18ax? — 12ax + 48a is primitive for all 
aeQ,aF0. 

17. 2ax? — 3ax + 6a is primitive for all a 4 0 in Z, because every such element a is a unit in Zy. 

21. a T c T e. T g. F i. F 


i. Either p or one of its associates must appear in every factorization into irreducibles. 
23, 2x +4 is irreducible in Q[x] but not in Z[x]. 
31. Partial answer: x3 — y> = (x — y)\(x? +xy + y’) 


SECTION 46 
1. Yes 3. No. (1) is violated. 5. Yes 
7. 61 9. xe +2x-1 11. 66 
13. a T e« T e. T g. T i. T 
23. Partial answer: The equation ax = b has a solution in Z, for nonzero a, b € Z,, if and only if the positive gcd of a and 
n in Z divides b. * 
SECTION 47 
1, 5=(14+ 211 — 2%) 3. 443) =(4+2i)(2 -7) 
5. 6 = (2)(3) = (-1 + V=3\(-1 - V-35) 7 74 
15. c. i) order 9, characteristic 3 id) order 2, characteristic 2 


iii) order 5, characteristic 5 


SECTION 48 


. f2,-72 3. 34-V72,3-V2 5. V2+i,V2-i,-V2 +i, -V2-i 
7. Vibe R sl aA Vi 9. 3 
1. —V¥2+375 13. —J/2+ V45 
15. a Q b. Qv6) c. Q 
17. QW2, V3, V5) 19. Q(V3, V10) 21. Q 
25, a. 3-— V2 b. They are the same maps. 

27. 03;(0) = 0, 03(1) = 1, 03(2) = 2, o3(@) = —a, 0320) = —2a, 
o73(lt+a)=1-—¢,0,4 + 2a) = 1-—2a,0;2+a)=2—a, 
o3(2 + 2a) =2- 2a; Z3(@)to3} = Ze 

29. a. F aT ae F g. T i. T 

37. Yes 
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SECTION 49 


1. The identity map of E onto E; 
t given by t(/2) = V2, (V3) = —V3, tS) = —V5 
3. t given by 1)(/2) = V2, 1(V3) = V3, (V5) = —V5; 
T given by (72) = /2, (V3) = —V3, (75) = V5; 
T3 given by t3(/2) = —V2, 3(V3) = V3, (V5) = V5; 
t given by t4(/2) = —V2, t4(V3) = —V3, t4(/5) = —V5S 
5. The identity map of Q(¥/2, V3) into itself; 
qT, given by t)(a,) = a, 11(/3) = —/3 where a, = V2; 
T2 given by t2(a) = a2, 19(/3) = V3 where a = /2(-1 + i/3)/2: 
T3 given by 13(a1) = a2, 3(V3) = —V3; 
Ts given by T4(o)) = 3, T4(/3) = V3 where a's = S/2(-1 — 1/3)/2; 
Ts given by t;(a@1) = a3, t5(¥3) = —V3; 


7. a. Q(x”) b. t given by 1)(./7) =i,/m, t given by (4/7) = —i./7r 
SECTION 50 
Ae. 32. 3. 4 5;..2 7 1 9 2 13. 1<[E:F]<n! 


15. Let F = Qand E = Q(/2). Then 
fx) = x4 —5x? +6 = (x7 —2)(x? - 3) 


has a zero in E, but does not split in £. 
23. a. 6 


SECTION 51 


a=VY2Q= 2/(/2V2)./S2 = (V2), 72 = (4/2). (Other answers are possible.) 
3% a=VJV24+73.J/2= Ge _ Ca, V3 = (Ge _ Se. (Other answers are possible.) 


fx) =x4 - 407 +4 = (0? 2y. Here f(x) is not an irreducible polynomial. Every irreducible factor of f(x) has 
zeros of multiplicty 1 only. 


15. b. The field F ec. F[x?] 

SECTION 52 

1. Z3(y?, 2°) 3. Zs(y*, z’) 

5. a F a F e F g. T i, T 
SECTION 53 

1. 8 3. 8 5. 4 Te 2. 


9. The group has two elements, the identity automorphism ¢ of Q(i) and o such that o(i) = —i. 
—1+iv3 sel —iv3 
11. a. Leta, = /2, 2 = pes, and a3 = ygrtciv3: 


2 
The maps are 
fo, where po is the identity map; 
fi, Where p;(a,) = a> and p, G73) = i/3; 
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po, where p2(a) = a3 and py(i-V3) = i V3; 

jay, where j11(o1,) = at, and pi (V3) = —i1/3; 
Ja, Where f(a) = a3 and p(iV3) = —i/3; 
43, Where j13(01) = a and p3(iV3) = -iV3. 


b. S3. The notation in (a) was chosen to coincide with the notation for S; in Example 8.7. 


{00s P1» Pat (Pi. Hil {Po Ha} {Po 3} 
{Pol 
Group diagram 
a i i 
QI = Kip, oo) VID =Kip, uw) QO.) = Kip) WO) = Kip, us 
Og a o a 
Field diagram 


13. The splitting field of (x? 1) € Qlx] is QG-V3), and the group is cyclic of order 2 with elements: 2, where ¢ is the 
identity map of Q(i/3), and o, where o G-V3) = —i-V3. 

15. a. F ec. T e. T g. F i. F 

25, Partial answer: G(K /(E V L)) = G(K/E) G(K/L) 


SECTION 54 


3. OY? i): JD H+ i, x8 + 4x6 + 2x4 + 28x? 4-1: 
Q/2): 72, x4 - 2; 
QG/2)): 1/2), x* — 2; 
QW2, i): V2 +i, xt — 2x? +9; 
OW/2 + i(12)): V2 + i(/2), x4 + 8; 
QW? — i(W/2)): 12 — i(/2), x* +8; 
QV2): V2, x? — 2; 
Q@): i, x? +1; 
QGV2): iJ/2, x? +2; 
Q:i,x-1 


5. The group is cyclic of order 5, and its elements are 


| L | Oy | 02 | 03 | O4 


>| 2 | ww) ) 20/2 | 68/2) | 4/2) 


where </2 is the real 5th root of 2. 
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7. The splitting field of x® — 1 over Q is the same as the splitting field of x+ + 1 over Q, so a complete description is 
contained in Example 54.7. (This is the easiest way to answer the problem.) 


9. a. 5)? — 25> b. so 3s 
SECTION 55 
3. a. 16 b. 400 ec. 2160 
5, 3° 
7. ©3(x) over Zo isx?+x+1. 
@ (x) over Z3 is xt + 1 = (x? +x + 2)(x? + 2x + 2). 
9, a T ce F e T g. 7 i. F 
11. @,(x) =—x-1l 
Oxy) =x4+1 
®3(x) =x? +x4+1 
Ox) =x? +1 
@5(x) =xt +a tx? +x41 
O.(x) =x* -—x41 
SECTION 56 
1. No. Yes, K is an extension of Z, by radicals. 
3. aT ce T e. T g. T i. F (x? — 2x over Q gives a counterexample.) 
APPENDIX & 
; 3 i 
Dyin Mt 
0 -i 
5 16 -3 1 -i 
E —18 | # Ree oe 


8 8 0 -1 
ff] ou Pap a 


poe Streetcar aren nahn 
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Abel, Niels Henrik, 39, 174, 324, 471 
Abelian extension, 455 
Abelian group(s), 39 

free, 334 

structure theorem for 

finitely-generated, 108 

torsion free, 113, 142 
Absolute value, 13 
Action 

faithfull, 155 

on a group, 154 

transitive, 86, 155 
Addition 

modulo 27, 16 

modulo n 18, 64 
Al-Banna, Abu-l-’ Abbas ibn 77 
Al-Tuse Sharaf al-Din, 206 
Algebra 

fundamental theorem of, 254, 288 

group, 223 

homological, 380 
Algebraic closure, 287, 288 
Algebraic closure of F in E, 286 
Algebraic element over F’, 267 
Algebraic extension, 283 
Algebraic homotopy, 388 
Algebraic integer, 463 
Algebraic number, 268 
Algebraic property, 16 
Algebraic variety, 255 
Algebraically closed field, 287, 292 
Alphabet, 341 
Alternating group on n letters, 91 
Annulus, 367 


Antisymmetric law, 288 
Arc of a diagraph, 70 
Archimedian ordering, 230 
Arithmetic, fundamental theorem 
of, 395 
Artin, Emil, 207, 419 
Ascending central series, 318 
Ascending chain condition, 
392, 401 

Aschbacher, Michael, 149 
Associates, 389 
Associative operation, 23, 37 
Automorphism. 

of a field, 416 

of a group, 66, 141 

fixed field of, 418, 419 

Frobenius, 421 

inner, 141 

of a ring, 232 
Axiom of choice, 288, 289 
Axis of reflection, 114 


Ball, 364 
Banach, Stefan, 275 
Basis 
for a finitely-generated abelian 
group, 345 
for a free abelian group, 334 
Grobner, 261 
for an ideal, 255 
for a vector space, 278 
Bessy, Bernard Frenicle de, 185 
Betti number, 109 
Bijection, 4 


Binary algebraic structure(s), 29 
isomorphic, 30 
structural property of, 31 
Binary operation, 11, 20 
Bloom, David M., 91 
Boolean ring, 177 
Boundary homomorphism, 358 
Boundary of a simplex, 357 
Bourbaki, Nicholas, 4, 289, 345 
Brahmagupta, 403 
Brouwer fixed-point theorem, 376 
Burnside, William, 149, 330 
Burnside’s formula, 161 


Cancellation laws, 41, 178 
Cardano, Girolamo, 206, 471 
Cardinality, 4, 5 
Cartesian product, 3, 104 
Cauchy Augustin-Louis, 77 
Cauchy’s theorem, 322 
Cayley, Arthur, 70, 81, 347 
Cayley digraph, 70 
Cayley’s theorem, 82 
Cell, 6 
n-, 364 
Center of a group, 58, 150, 318 
Centroid, 115 
Chain(s), 288, 358 
Chain complex, 380 
subcomplex of, 381 
Chain condition, 
ascending, 392, 401 
descending, 401 
Characteristic of a ring, 181 
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Chief series, 315 
Class 
conjugate, 328 
equivalence, 8 
homology, 367 
residue modulo H, 137 
residue modulo n, 6 
Class equation, 328 
Closed interval, 9 
Closed set under an operation, 15, 21 
Closed surface, 371 
Closure 
algebraic, 286, 287 
in an ordering, 228 
separable of F in E, 443, 446 
totally inseparable of F in E, 447 
Closure condition, 15, 21, 228 
Coboundary, 363 
Cochain, 363 
Cocycle, 363 
Codomain, 4 
Coefficients 
of a polynomial, 199 
torsion, 113 
Cohomology group, 363 
Column vector, 478 
Commensurable numbers, 205 
Commutative operation, 23 
Commutative ring, 172 
Commutator, 143, 150 
Commutator subgroup, 143, 150: 
Comparable elements, 288 
Complex number, 3, 12 
absolute value of, 13 
conjugate of, 416 
Complex, simplicial, 358 
Composition, function, 22, 23 
associativity of, 23 
Composition series, 315 
Congruence 
modulo H, 137 
modulo n, 7 
Conjugate class, 328 
Conjugate complex numbers, 416 
Conjugate elements over F, 416 
Conjugate subgroups, 141, 143 
Conjugation, 141 
Conjugation isomorphism, 416 
Connected component, 365 
Connected space, 365 
Consequence, 348 
Constant polynomial, 199 
Constructible number, 293 
Constructible polygon, 466 
Content of a polynomial, 396 
Continuous function, 377 
Contractible space, 365 


Contraction, elementary, 341 
Correspondence, one-to-one, 4 
Coset, 97 
double, 103 
left, 97 
right, 97 
Coset group, 137 
Crelle, August, 39 
Cross cap, 378 
Cycle(s), 89, 359, 380 
disjoint, 89 
homologous, 367 
length of, 89 
Cyclic extension, 456 
Cyclic group, 54, 59 
Cyclic subgroup, 54, 59 
Cyclotomic extension, 464 
Cyclotomic polynomial, 217, 465 


Decomposable group, 109 
Dedekind, Richard, 174, 241, 419 
Definitions, 1 
Degree 

of a over F’, 269 

of an extension, 283 

of a polynomial, 199 
Derivative of a polynomial, 443 
Descartes, Rene, 198 
Descending chain condition, 401 
Determinant of a square matrix, 46, 

479, 480 

Diagonal matrix, 46 
Digraph, 70 

arc of, 70 

vertex of, 70 
Dihedral group, 79, 86 
Dimension of a vector space over F, 

280 

Direct product, 105 

external, 108 

internal, 108 

of rings, 169 
Direct sum, 105 

of vector spaces, 281 
Dirichlet, Peter Lejeune, 174 
Discrete frieze group, 116 
Discriminant of a polynomial, 463 
Disjoint cycles, 89 
Disjoint sets, 6 
Disjoint union of G-sets, 160 
Distributive law, 167 
Division algorithm 

for Z, 60 

for F[x], 210, 220 
Division ring, 173 
Divisor, 256, 389 

greatest common, 62, 395 


of a polynomial, 217 
of zero, 178 
Domain 
Euclidean, 401 
of a function, 4 
integral, 179 
principal ideal, 391 
unique factorization, 390 
Dot product, 478 
Double coset, 103 
Doubling the cube, 297 


Eisenstein criterion, 215 
Element(s), 1 
algebraic over F’, 267 
comparable, 288 
conjugate over F’, 416 
fixed, 418 
idempotent, 28, 48, 176, 183 
identity, 32, 38 
independent transcendental, 473 
inverse of, 38 
irreducible, 389 
maximal, 288 
nilpotent, 176, 245 
orbit of, 84, 87, 158 
order of, 59 
positive, 228 
prime, 394 
primitive, 441 
separable over F, 438 
totally inseparable over F, 444 
transcendental over F, 267 
Elementary contraction, 341 
Elementary symmetric function, 
457 
Empty set, 1 
Empty word, 341 
Endomorphism, 220 
Equality relation, 13 
Equation, class, 328 
Equivalence class, 8 
Equivalence relation, 7 
Escher M. C., 118 
Euclid, 185, 403 
Euclidean algorithm, 404 
Euclidean domain, 401 
Euclidean norm, 401 
Euler, Leonard, 13, 39, 186, 468 
Euler characteristic, 374 
Euler formula, 13 
Euler phi-function, 104, 187 
Euler’s theorem, 187 
Evaluation homomorphism, 126, 
171, 201 
Even permutation, 92 
Exact sequence, 385 


Exact homology sequence of a pair, 
386 

Extension(s), 265 

abelian, 455 

algebraic, 283 

cyclic, 456 

cyclotomic, 464 

degree of, 283 

finite, 283 

finite normal, 448 

index of, 428 

join of, 456 

of a map, 425 

by radicals, 470 

separable, 438, 443 

simple, 270 

totally inseparable, 444 
Extension field, 265 
External direct product, 108 


Face of a simplex, 357 

Factor, 256, 389 
of a polynomial, 256 

Factor group, 137, 139 

Factor ring, 242 

Factor theorem, 211 

Faithfull action, 155 

Feit, Walter, 149, 330 

Fermat, Pierre de, 185 

Fermat prime, 468 

Fermat’s last theorem, 390 * 

Fermat’s p = a? +b’ theorem, 411 

Fermat’s theorem, 184 

Ferrari, Lodovico, 471 

Ferro, Scipione del, 471 

Field, 173 
algebraic closure of, 287,288 
algebraic closure in E, 286 
algebraically closed, 287, 292 
automorphism of, 418 
extension of, 265 
fixed, 418, 419 
formal Laurent series, 231 
formal power series, 230 
Galois, 300 
perfect, 440 
prime, 250 
quotient in a, 179 
of quotients, 194, 
of rational functions, 201 
separable closure in E, 443, 446 
separable extension of, 438, 443 
simple extension of, 270 
skew, 173 
splitting, 432 
strictly skew, 173 
subfield of, 173 


Field extension, 265 
simple, 270 
Finite-basis condition, 401 
Finite basis for an ideal, 256 
Finite extension, 283 
degree of, 283 
Finite group, 43 
Finite presentation, 348 
Finite-dimensional vector space, 277 
Finitely-generated group, 69 
Fixed elements, 418 
Fixed field, 418, 419 
Fixed point, 119, 376 
Fixed subfield, 418 
Formal Laurent series, 231 
Formal power series, 230 
Free abelian group, 334 
basis for, 334 
rank of, 336 
Free generators, 342 
Free group, 342 
rank of, 342 
Frey, Gerhard, 390 
Frieze group, 116 
Frobenius, Georg, 324 
Frobenius automorphism, 421 
Frobenius homomorphism, 244 
Frobenius substitution, 421 
Function(s), 4 
codomain of, 4 
composite, 22, 23 
composition of, 22, 23 
continuous, 377 
domain of, 4 
elementary symmetric, 457 
Euler phi-, 104, 187 
image of A under, 82, 128 
inverse of, 5 
inverse image under, 128 
one-to-one, 4 
onto, 4 
phi-, 104, 187 
polynomial on F, 209 
range of, 4, 128 
rational, 201 
restricted, 308 
symmetric, 457 
two-to-two, 10 
Fundamental homomorphism 
theorem, 140, 242 
Fundamental theorem of algebra, 
254, 288 
Fundamental theorem of arithmetic, 
395 
Fundamental theorem of 
finitely-generated abelian 
groups, 108, 338 
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G-set(s), 154 
disjoint union of, 160 
isomorphic, 159 
orbits of, 158 
sub-, 159 
transitive, 155 
union of, 204 
Gallian, Joseph A., 118 
Galois, Evariste, 132, 174, 302, 
317, 464 
Galois field, 300 
Galois group, 451 
Gauss, Carl F., 38, 108, 298, 302, 
408, 464 
Gauss’s lemma, 396 
Gaussian integer, 196, 407 
General linear group, 40 ; 
General polynomial of degree n, 457 
Generating set, 68, 69 
Generator(s), 54, 59, 68, 69 
for a presentation, 348 
free, 342 
of a group, 54, 59, 
of a principal ideal, 250, 339 
relation on, 73, 348 
for a vector space, 276 
Genus, 379 
Glide reflection, 114 
nontrivial, 116 
Grassmann, Hermann, 275 
Greatest common divisor, 62, 395 
Griess, Robert L. Jr., 149 
Grébner basis, 261 
Group(s), 37 
abelian, 39 
alternating on n letters, 93 
ascending central series of, 318 
automorphism of, 66, 141 
of automorphisms, 420 
center of, 58, 150, 318 
cohomology, 363 
commutator in a, 143, 
of cosets, 137 
eyclic, 54, 59 
decomposable, 109 
dihedral, 79, 86 
direct product of, 105 
direct sum of, 105 
discrete frieze, 116 
endomorphism of, 220 
factor, 137, 139 
finite, 43 
finitely-generated, 69 
free, 342 
free on A, 342 
free abelian, 334 
frieze, 116 
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Group(s) (cont.) 
Galois, 451 
general linear, 40 
generator(s) of, 54, 59, 68, 69 
homology, 361, 380 
homomorphism of, 125 
indecomposable, 109 
inner automorphism of, 141 
isomorphic, 45, 
Klein 4-, 51 
of n-boundaries, 359, 380 
of n-chains, 358, 380 
of n-cycles, 359, 380 
octic, 79, 352 
order of, 50 
p-, 322 
plane crystallographic, 117 
of a polynomial over F, 452 
presentation of, 347, 348 
quaternion, 352 
quotient, 139 
regular representation of, 83 
relative homology, 383 
series of, 311 
simple, 149 
solvable, 317 
subgroup of, 50 
symmetric on # letters, 78 
of symmetries, 79, 114 
torsion, 142 
torsion free, 142 & 
wallpaper, 117 

Group action, 154 

Group algebra, 223 

Group ring, 223 

Group table, 43 


Half-open interval, 15 
Hamilton, Sir William Rowan, 
224, 275 
Hilbert, David, 168 
Hilbert basis theorem, 256 
Holder, Otto, 317, 347 
Homeomorphic spaces, 355 
Homeomorphism, 355 
Homological algebra, 380 
Homology class, 367 
Homology group, 361, 380 
relative, 383 
invariance property of, 364 
Homomorphism, 30, 125, 171 
boundary, 358 
coboundary, 363 
evaluation, 126, 171, 201 
Frobenius, 244 
fundamental theorem for, 140, 242 
kemel of, 129, 171, 238 


projection, 127, 237 

of aring, 171, 237 

trivial, 126 
Homomorphism property, 29, 30, 125 
Homotopy, algebraic, 388 


Ideal(s), 241 
ascending chain condition for, 
392, 401 
basis for, 255 
descending chain condition for, 401 
finite basis for, 256 
finite-basis condition for, 401 
improper, 246 
left, 254 
maximal, 247 
maximum condition for, 401 
minimum condition for, 401 
nilradical of, 245 
prime, 248 
principal, 250 
product of, 254 
quotient of, 254 
radical of, 245 
right, 254 
sum of, 254 
trivial, 246 
Idempotent element, 28, 48, 
176, 183 
Identity element, 32, 38 
left, 35, 43 
right, 35 
Image 
of A, 82, 128 
inverse, 128 
under a map, 82, 128 
Imaginary number, 12 
Improper ideal, 246 
Improper subgroup, 57 
Improper subset, 1 
Indecomposable group, 109 
Independent transcendental 
elements, 473 
Indeterminate, 198 
Index 
of E over F, 428 
of a subgroup, 101 
Induced operation, 21 
Induced ordering, 228, 231 
Infinite order, 59 
Infinite set, 5 
Injection, 4, 133 
Injection map, 4, 133, 194 
Inner automorphism, 141 
Integer(s), 3 
algebraic, 463 
Gaussian, 196, 407 


rational, 408 
relatively prime, 62, 374 
Integral domain, 179 
associates in, 389 
Euclidean norm on, 401 
prime element of, 394 
field of quotients of, 194 
unit in, 389 
Internal direct product, 108 
Intersection, 59, 69 
Interval 
closed, 9 
half open, 15 
Invariant series, 311 
Invariant subgroup, 141 
Inverse 
of an element, 38 
left, 43 
multiplicative, 173 
of a matrix, 479 
Inverse function, 5 
Inverse map, 5 
Inverse image under a map, 128 
Invertible matrix, 479 
Irreducible element, 389 
Irreducible polynomial, 214 
for a over F, 269 
in F [x], 214 
Isometry, 114 
Isomorphic binary structures, 30 
Isomorphic G-sets, 159 
Isomorphic groups, 45 
Isomorphic presentations, 348 
Isomorphic rings, 172 
Isomorphic series, 312 
Isomorphism, 16 
of a binary structure, 29 
conjugation, 416 
of a G-set, 159 
of a group, 45, 132 
of a ring, 172 
of a vector space, 282 
Isomorphism extension theorem, 
425, 428 
Isomorphism theorems, 307-309 
Isotonicity, 229 
Isotropy subgroup, 157 


Join 

of extension fields, 456 

of subgroups, 308 
Jordan, Camille, 39, 132, 317 
Jordan-Hélder theorem, 316 


Kernel, 129, 171, 238 
of a linear transformation, 282 
Khayyam, Omar, 206 


Klein bottle, 371 

pinched, 388 
Klein 4-group, 51 
Kronecker, Leopold, 108, 174, 266 
Kronecker’s theorem, 266 
Kummer, Ernst, 108, 241, 390 


Lagrange, Joseph-Louis, 38, 77, 96, 
100, 471 
theorem of, 100, 146 
Lame, Gabriel, 390 
Laurent series, formal, 231 
Law 
antisymmetric, 288 
cancellation, 41, 178 
distributive, 167 
reflexive, 288 
transitive, 288 
Least common multiple, 67, 
107, 407 
Left cancellation law, 41 
Left coset, 97 
Left ideal, 254 
Left identity, 35, 43 
Left inverse, 43 
Left regular representation, 83 
Length of a cycle, 89 
Letter, 341 
Levi ben Gerson, 77 
Levinson, Norman, 304 
Lexicographical order, 260 
Lindemann, Ferdinand, 298 
Linear combination, 276 
Linear transformation, 127, 282 
kernel of, 282 
Linearly dependent vectors over F,, 
277 
Linearly independent vectors over F, 
277 
Liouville, Joseph, 390 


Main diagonal of a matrix, 46, 480 
Main theorem of Galois theory, 451 
Map, 4 

extension of, 425 

injection, 41, 133, 194 

inverse of, 4 

projection, 127 

range of, 4, 128 

restricted, 308 
Matrix, 477 

determinant of, 46, 479, 480 

diagonal, 46 

inverse of, 479 

invertible, 479 

main diagonal of, 46, 480 

orthogonal, 55 


permutation, 87 

product of, 478 

singular, 479 

square, 477 

sum of, 477 

trace of, 133 

transpose of, 55 

upper-triangular, 46 
Matrix representation, 36 
Maximal element, 288 
Maximal ideal, 247 
Maximal normal subgroup, 149 
Maximum condition, 401 
McKay, J. H., 322 
Mersenne prime, 185 
Minimal polynomial for a over F’, 
273 
Minimal subset, 53 
Minimum condition, 401 
Mobius strip, 372, 373 
Monic polynomial, 269 
Monoid, 42 
Multiple, least common, 67, 
107, 407 
Multiplication 
by components, 104, 105 
modulo n, 169 
permutation, 76 
Multiplicative inverse, 173 
Multiplicative norm, 410 
Multiplicity of a zero, 436 


n-ball, 364 
n-boundary, 359, 380 
n-cell, 364 
n-chain, 358 
n-cycle, 359, 380 
n-sphere, 364 
Nilpotent element, 176, 245 
Nilradical, 245 
Noether, Emmy, 168, 419 
Nontrivial subgroup, 61 
Norm 
Euclidean, 401 
multiplicative, 410 
over F, 455 
Normal extension, finite, 448 
Normal series, 311 
Normal subgroup, 132, 141 
maximal, 149 
Normalizer of a subgroup, 323 
Nullstellensatz, 254 
Number(s) 
algebraic, 268 
betti, 109 
commensurable, 205 
complex, 3, 12 
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constructible, 293 

imaginary, 12 

rational, 3 

real, 3 

transcendental, 268 
Nunke, R. J., 322 


Octic group, 79, 352 
Odd permutation, 92 
One-to-one correspondence, 4 
One-to-one function, 4 
One-sided surface, 371 
Onto function, 4 
Operation 
associative, 23, 37 
binary, 11, 20 
commutative, 23 
induced, 21 
well-defined, 25 
Orbit, 84, 87, 158 
Order 
of a group, 50 
of an element, 59 
infinite, 59 
term, 260 
Ordered ring, 228 
Ordering 
Archimedian, 230 
induced, 228, 231 
lexicographical, 260 
natural, 228 
partial, 288 
of power products, 259 
of a ring, 228 
Orientation, 114, 356 
Oriented n-simplex, 356 
Orthogonal matrix, 55 


p-group, 322 
p-subgroup, 322 
Partial ordering, 288 
Partition, 6 

cells of, 6 

of n, 333 
Pattern, periodic, 117 
Peano, Giuseppe, 275 
Perfect field, 440 
Periodic pattern, 117 
Permutation, 76 

even, 92 

multiplication, 76 

odd, 92 

orbits of, 84, 87 

sign of, 135 
Permutation matrix, 87 
Phi-function, 104, 187 
Plane, translation of, 114 
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Plane crystallographic group, 117 
Plane isometry, 114 
Point, fixed, 119, 376 
Polygon, constructible, 466 
Polynomial(s), 199 
coefficients of, 199 
constant, 199 
content of, 396 
cyclotomic, 217, 465 
degree of, 199 
derivative of, 443 
discriminant of, 463 
divisor of, 217, 256 
Eisenstein, 215 
factor of, 256 
general of degree n, 457 
group over F of, 452 
irreducible for w over F, 269 
irreducible over F, 214 
irreducible, 214 
minimal for a over F, 273 
monic, 269 
primitive, 396 
reducible, 214 
separable over F’, 438 
solvable by radicals over F, 470 
splitting field of, 432 
term ordering of, 260 
zero of, 204, 255 
Polynomial function on F’, 209 
Positive element, 228 * 
Power product, 259 
ordering of, 259 
Power series, formal, 230 
Power set, 8 
Presentation, 347, 348 
finite, 348 
generators for, 348 
isomorphic, 348 
Prime, 394 
Fermat, 468 
Mersenne, 185 
Prime field, 250 
Prime ideal, 248 
Primitive element, 441 
Primitive element theorem, 441 
Primitive nth root of unity, 
67, 301 
Primitive polynomial, 396 
Principal ideal, 250 
generator of, 250 
Principal ideal domain, 391 
Principal series, 315 
Product 
Cartesian, 3, 104 
direct, 105, 169 
of ideals, 254 


of matrices,.478 

power, 259 
Projection homomorphism, 127, 237 
Projection map, 127 
Projective plane, 
Proper subgroup, 51 
Proper subset, 1 
Property 

algebraic, 16 

structural, 11, 31 
Pythagorean theorem, 205 


Qin Jiushao, 403 
Quaternion group, 352 
Quaternions, 224 
Quotient, 
in the division algorithm, 60 
in a field, 179 
of ideals, 254 
Quotient group, 139 
Quotient ring, 242 
Quotient space, 282 


Rabin, Michael, 348 
Radical(s) 
extension by, 470 
of an ideal, 245 
Range of a map, 4, 128 
Rank, 336, 342 
Rational function, 201 
Rational integer, 408 
Rational number, 3 
Real number, 3 
Reduced word, 341 
Reducible polynomial, 214 
Reduction modulo n, 127 
Refinement of a series, 311 
Reflection, 114 
axis of, 114 
glide, 114 
Reflexive law, 288 
Reflexive relation, 7 
Regular representation, 83 
left, 83 
right, 83 
Relation(s), 3, 73, 348 
consequence of, 348 
equality, 3 
equivalence, 7 
reflexive, 7 
symmetric, 7 
transitive, 7 
Relative homology group, 383 
Relatively prime, 62, 374 
Relator, 348 
Remainder in the division 
algorithm, 60, 210, 220 


Repyesentation 


left regular, 83 
matrix, 36 
right regular, 83 


Residue class 


modulo H, 137 
modulo n, 6 
Restricted map, 308 
Ribet, Ken, 390 
Right cancellation law, 41 
Right coset, 97 
Right ideal, 254 
Right identity, 35 


Right regular representation, 83 


Ring(s), 167 
additive group of, 168 
automorphism of, 232 
Boolean, 177 
characteristic of, 181 
commutative, 172 
division, 173 
of endomorphisms, 220 
factor, 241 
formal power series, 230 
group, 223 
homomorphism, 171, 237 
ideal of, 242 
isomorphic, 172 
isomorphism of, 172 
maximal ideal of, 247 
nilradical of, 245 
ordered, 228 
of polynomials, 200, 201 
prime ideal of, 248 
quotient, 241 
radical of, 245 
simple, 253 
subring of, 173 
unit in a, 173, 389 
with unity, 172 
zero, 172 

Roots of unity, 18 
nth, 18 
primitive mth, 67, 301 

Rotation, 114 

Row vector, 478 

Ruffini, Paolo, 471 


Scalar, 275 

Schreier theorem, 314 

Sefer Yetsirah, 77 

Semigroup, 42 

Separable closure of F in E, 
443, 446 


Separable element over F, 438 


Separable extension, 438, 443 


Separable polynomial over F’, 438 


Sequence of groups 
exact, 385 
exact homology, 386 
Series 
ascending central, 318 
chief, 315 
composition, 315 
formal Laurent, 231 
formal power, 230 
invariant, 311 
isomorphic, 312 
normal, 311 
principal, 315 
refinement of, 311 
subnormal, 311 
Set(s), 1 
binary operation on, 20 
cardinality of, 4 
Cartesian product of, 3, 104 
closed under an operation, 
21, 35, 
disjoint, 6 
element of, 1 
empty, 1 
G-, 154 
generating, 68, 69 
infinite, 5 
intersection of, 59, 69 
partial ordering of, 288 
partition of, 6 
permutation of, 76 
power, 8 
subset of, 1 
union of, 391 
well-defined, 1 
Shimura, Goro, 390 
Sign of a permutation, 135 
Simple extension, 270 
Simple group, 149 
Simple ring, 253 
Simplex, 356 
boundary of, 357 
face of, 357 
Simplicial complex, 358 
Singular matrix, 479 
Skew field, 173 
Smallest subset, 53 
Solvable group, 317 
Solvable polynomial over F, 470 
Space (see topological space) 
Span, 276 
Sphere, 364 
Splitting field, 422 
Square matrix, 477 
determinant of, 46, 479, 480 
main diagonal of, 46, 480 
trace of, 133 


Squaring the circle, 297 
Strictly skew field, 173 
Structure(s) 

binary algebraic, 29 

isomorphic, 30 

isomorphism of, 29 
Structural property, 11, 31 
Subcomplex, 381 

simplicial, 382 
Sub-G-set, 154 
Subfield, 173 

fixed, 418 
Subgroup(s), 50 

commutator, 143, 150 

conjugate, 141, 143 

cyclic, 54, 59 

improper, 51 

index of, 101 

invariant, 141 

isotropy, 157 

join of, 308 

maximal normal, 149 

nontrivial, 51 

normal, 132, 141 

normalizer of, 323 

p-, 322 

proper, 51 

Sylow p-, 221 

torsion, 112 

trivial, 51 
Subnormal series, 311 
Subring, 173 

generated by a, 177 
Subset, 1 

improper, 1 

minimal, 53 

proper, | 

smallest, 53 

upper bound for, 288 
Subspace of a vector space, 281 
Sum 

direct, 105 

of ideals, 254 

of matrices, 477 

modulo 27, 16 

modulo n, 18, 64 
Surface, closed, 371 

one sided, 371 

genus of, 379 
Surjection, 4 
Syllable, 341 
Sylow, Peter Ludvig Mejdell, 324 
Sylow p-subgroup, 325 
Sylow theorems, 324, 325 
Symmetric function, 457 

elementary, 457 
Symmetric group on x letters, 78 
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Symmetric relation, 7 
Symmetries, group of, 79, 114 


Table, group, 43 
Taniyama, Yutaka, 390 
Tartaglia, Niccolo, 471 
Taylor, Richard, 390 
Term ordering, 260 
Thompson, John G., 330 
Topological space(s), 355 
connected, 365 
connected component of, 365 
contractible, 365 
Euler characterictic of, 374 
homeomorphic, 355 
mapping of, 375 
triangulation of, 364 
Torsion coefficient, 113 
Torsion free, 113, 142 
Torsion group, 142 
Torsion subgroup, 112 
Torus, 368 
pinched, 387 
Totally inseparable closure of F 
in E, 447 
Totally inseparable element over F’, 
444 
Totally inseparable extension, 444 
Trace of a matrix, 133 
Trace over F, 455 
Transcendental element over F', 267 
Transcendental number, 268 
Transitive action, 155 
Transitive G-set, 155 
Transitive law, 288 
Transitive relation, 7 
Transitive subgroup of S,, 86 
Transitivity, 229 
Translation, 114 
Transpose of a matrix, 55 
Transposition, 90 
Triangulation, 364 
Trichotomy, 228, 229 
Trisection of an angle, 297 
Trivial homomorphism, 126 
Trivial ideal, 246 
Trivial subgroup, 51 
Two-to-two function, 10 


Union 
of sets, 391 
of G-sets, 160 
Unique factorization domain, 390 
Unit, 173, 389 
Unity, 172 
nth root of, 18, 301 
primitive nth root of, 67, 301 
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Upper bound for a subset, 288 
Upper-triangular matrix, 48 


Van der Waerden, B. L., 419 
Variety, algebraic, 255 
Vector(s), 275, 478 
column, 478 
linear combination of, 276 
linearly dependent over F, 277 
linearly independent over F, 
277 
row, 478 
Vector space(s), 274, 275 
basis for, 278 
dimension over F’, 280 
direct sum of, 281 
finite-dimensional, 277 
isomorphism of, 282 


linear transformation of, 282 
subspace of, 281 
Vertex 
of a digraph, 70 
of a simplex 
Viete, Francois, 198 
Von Dyck, Walter, 38, 81 


Wallpaper group, 117 

Wantzel, Pierre, 298 

Weber, Heinrich, 38, 174, 419 

Wedderburn, Joseph Henry 
Maclagan, 224 

Wedderburn theorem, 226 

Weierstrass, Karl, 266 

Well-defined operation, 25, 137 

Well-defined set, 1 

Weyl], Hermann, 275 


Wey] algebra, 222 
Wiles, Andrew, 390 
Wilson’s theorem, 190 
Word(s), 341 

empty, 341 

reduced, 341 
Word problem, 348 


Zassenhaus, Hans, 313 
Zassenhaus lemma, 314 
Zermelo, Ernst, 289 
Zero 

multiplicity of, 436 

of a polynomial, 204, 255 
Zero divisors, 178 
Zero ring, 172 
Zorn, Max, 289 
Zorn’s lemma, 289 


